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A STUDY OF PERCEIVED NEED SATISFACTIONS 
IN BOTTOM AND MIDDLE MANAGEMENT JOBS' 


LYMAN W. PORTER ? 


University of California, Berkeley 


Studies of management jobs in industry and 
business have in the past tended to concen- 
trate on the technical aspects of these jobs, 
such as lists of duties, responsibilities, func- 
tions or activities performed, or on the per- 
sonality traits of individuals filling particular 
jobs. Except for such recent investigations as 
those by Triandis (1959a, 1959b), relatively 
few studies have been concerned with how the 
psychological characteristics of management 
jobs are perceived by the individuals in the 
jobs. An understanding of the nature of job 
perceptions held by individuals in manage- 
ment positions would seem to be appropriate 
for the study of organizational problems. In 
many cases, for example, individuals are pro- 
moted within management on the basis of 
their technical qualifications for a job, while 
their performance may in large part depend 
on how well they are able to adjust to such 
psychological aspects of the job as the types 
of motivational rewards received, the various 
pressures encountered, and the perceived ex- 
pectations of superiors and subordinates. In 
short, some individuals may be qualified tech- 
nically for particular management jobs, but 
do not fit the psychological nature of the jobs. 
If management and the individuals them- 
selves knew more about the psychological as- 
pects of jobs, and of the differences between 


‘This study was begun as part of the research 
program of the Institute of Industrial Relations at 
the University of California, Berkeley. It was con- 
tinued under a fellowship granted by the Ford Foun 
dation 

*The assistance of Robert Andrews and Mildred 
M. Henry is gratefully acknowledged. The Institute 
of Social Sciences, University of California, Berkeley, 
contributed to the support of the research assistance 


jobs at different levels or in different parts of 
the organization, promotional and other per- 
sonnel errors might be reduced and organiza- 
tional effectiveness thereby increased. 

The present investigation is concerned with 
one area of management job perceptions, 
namely, need satisfactions. Although a num- 
ber of studies have been carried out on the 
need satisfactions connected with various 
types of workers’ jobs, very few have con- 
centrated on management jobs, particularly 
where the level within the management hier- 
archy has been taken into account. As Haire 
(1959) has pointed out in a recent review 
of psychological problems relevant to busi- 
ness and industry, “one area of motivational 
studies is surprisingly lacking—the study of 
motivation in management motivational 
analysis has not included the various levels 
of management” (p. 187). Studies of motiva- 
tion at any level of the organization have had 
to face the problem of how to name and clas- 
sify various motives and needs. One of the 
most useful systems for the psychologist who 
is studying motivation in the industrial situa- 
tion is the grouping of motives or needs ac- 
cording to a hierarchy of prepotency (e.g., 
Maslow, 1943, 1954). Such a theory states, 
in essence, that there are basic or primary 
needs, such as those for and 
that an individual satisfies (at least 
minimally) first, after which he turns to so- 
called higher-order needs, such as those for 
affiliation, nurturance, and esteem. Finally, if 
the individual has achieved some degree of 
satisfaction of these first-order and middle- 
order needs, he may then spend effort on try- 
ing to satisfy the highest-order need, that of 


food, water, 


sleep 
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self-actualization—the “desire for self-fulfill- 
ment, namely . . . the tendency [for a per- 
son] to become actualized in what he is po- 
tentially . . . the desire to become more and 
more what one is, to become everything that 
one is capable of becoming . . .” (Maslow, 
1954, p. 91). As Maslow has pointed out, this 
hierarchy of needs is not a rigid, fixed order 
that is the same for all individuals. Especially 
in the area of middle-order needs, such as 
various social and esteem needs, the order 
varies somewhat from person to person. How- 
ever, this theoretical approach to motivation 
clearly carries with it the idea that physio- 
logical and safety or security needs are the 
most prepotent needs, and that self-actualiza- 
tion needs are usually the least prepotent for 
most individuals. 

‘This conceptualization of a hierarchy of 
need satisfaction has been used in one form 
or another by various writers (e.g., Argyris, 
1957; Davis, 1957; Haire, 1956, 1959; 
Leavitt, 1958; Smith, 1955; Viteles, 1953) 
in discussions of need satisfactions of indi- 
viduals working in industrial organizations. 
Generally, the point is made that such or- 
ganizations tend to “pay” the worker in 


physical or security need satisfaction areas 
rather than in higher-order areas, such as so- 
cial, esteem, or self-actualization satisfactions. 
Argryis (1957) puts the idea succinctly when 


he says that “in effect . . . organizations are 
willing to pay high wages and provide ade- 
quate seniority if mature adults will, for eight 
hours a day, behave in a less than mature 
manner” (p. 66). The situation for mass pro- 
duction workers is often cited as a prime illus- 
tration of this pattern of need satisfaction 
in large organizations (Argyris, 1957; Haire, 
1956; Walker & Guest, 1952). Thus, this mo- 
tivational reward system has been thought of 
as especially true for nonmanagement produc- 
tion workers, but little attention has been di- 
rected towards the appropriateness of this pic- 
ture for the management part of the organi- 
zation. Haire (1959) has correctly pointed 
out that: “The emphasis on social and ego- 
istic need satisfactions at work has been pri- 
marily on the hourly paid worker .. .” (p. 
187). Argyris, however, referring to these 
types of need satisfaction, makes an assump- 
tion that would apply throughout the whole 


organization. It is his contention that the 
higher an individual is in an organization the 
more able he is to satisfy higher-order needs. 
especially those pertaining to self-actualiza- 
tion (1957, pp. 66, 94). That is, there exists 
a differential opportunity within management 
to satisfy different types of needs. 

In line with the above discussion, the fol- 
lowing specific problems were investigated in 
this study: 


1. Perceived deficiencies in psychological need 
fulfillment 
(a) Differences between bottom- anc mid- 
dle-management levels: 
Are there differences in perceived de- 
ficiencies of need fulfillment? 
In which specific need areas are the 
differences greatest? 
(b) Differences among need areas: 
Within each of the two management 
levels, which need areas produce the 
largest frequencies of perceived de- 
ficiency of need fulfillment? 


. Importance attached to types of psycho- 
logical needs 


(a) Differences between bottom- and mid- 
dle-management levels: 


Are there differences in the importance 
attributed to various types of psy- 
chological needs? 

In which specific need areas are the 
differences greatest? 

(6) Differences among need areas: 

Within each of the two management 
levels, which need areas produce the 
largest frequencies of attributed im- 
portance? 


METHOD 


Ouestionnaire Instrument 


The data for this study were obtained by the ad- 
ministration of a three-section questionnaire. Two of 
the sections concerned information not pertinent to 
the present investigation and will not be reported 
further. The other section listed 18 management- 
position characteristics to be rated, 15 of which were 
concerned with need satisfaction. The information 
obtained from these 15 items constitutes the data to 
be reported here. The printed instructions for this 
part of the questionnaire state (in part) the following: 


ay 3 
ai 
mae 
| 
‘ 
<4 
ik 
| ite 
| 


Need Satisfactions in Management Jobs 


On the following pages of Part I will be listed 
several characteristics or qualities connected with 
management positions. For each such character- 
istic, you will be asked to give three ratings: 

a. How much of the characteristic is there now 

connected with your management position? 

b. How much of the characteristic do you think 

should be connected with your management 
position ? 
How important is this position characteristic 
to you? 


For each of the 15 items the Ss were asked to an- 
swer the above three questions by circling a number 
on a rating scale from 1 to 7, where “low numbers 
represent low or minimum amounts, and high num- 
bers represent high or maximum amounts.” Thus, a 
typical item looked like this on the questionnaire: 

The feeling of self-esteem a person gets from being 
in my management position: 

a) How much is there 

now? 

b) How much should 

there be? 
How important is 
this to me? 


(min) 1 (max) 


(min) 1 (max) 


(min) 1 (max) 


Categories of Needs and Specific Items 


‘ The categories of needs studied in the present in- 
vestigation, along with the specific items used to 
obtain information relevant to each category, are 
given below. The items were arranged randomly in 
the questionnaire, but are here listed systematically 
according to their respective need categories. The cate- 
gories are arranged in approximate hierarchical or- 
der from lowest-order (most prepotent) to highest- 
order (least prepotent). The categories and their hier- 
archical arrangement are in general agreement with 
the classification system used by Maslow (1954), 
with two major exceptions. First, the questionnaire 
did not contain any items relating to the most pre- 
potent needs—physiological needs—since these needs 
are presumably so adequately satisfied for any mana- 
gerial person that questions concerning them would 
appear irrelevant and unnecessary to the respondent 
The second major deviation from Maslow’s system is 
the addition of an “autonomy” category. The items 
included under this category of autonomy are ones 
that in Maslow’s system would be included in the 
“esteem” category; here, however, these items have 
been put into a separate category since it seemed 
that they are logically distinct from items 
that are more commonly associated with the term 
“esteem.” Therefore, the autonomy items have been 
inserted in the hierarchical order of needs between 
the esteem category and the self-actualization cate- 
gory, to which they have some relation. In addition 
to items that could be classified under particular 
need categories, two other items were included that 
seemed relevant to two or more of the categories. 
These two items are listed separately below as non- 
specific items. The categories and their items follow: 


other 


. Security needs 
a. The feeling of security in my management po- 
sition 
. Social needs 
a. The opportunity, in my management position, 
to give help to other people 
b. The opportunity to develop close friendships 
in my management position 
Esteem needs 
a. The feeling of self-esteem a person gets from 
being in my management position 
b. The prestige of my management position in- 
side the company (that is, the regard received 
from others in the company) 
c. The prestige of my management position out- 
side the company (that is, the regard re- 


ceived from others not in the company) 


Autonomy needs 
a. The authority connected with my manage- 
ment position 
b. The opportunity for independent thought and 
action in my management position 
c. The opportunity, in my management position, 
for participation in the setting of goals 
d. The opportunity, in my management position, 
for participation in the determination of meth- 
ods and procedures 
Self-Actualization needs 
a. The opportunity for personal growth and de- 
velopment in my management position 
b. The feeling of self-fulfillment a person gets 
from being in my management position (that 
is, the feeling of being able to use one’s own 
unique capabilities, realizing one’s potentiali- 
ties) 
c. The feeling of worthwhile accomplishment in 
my management position 
Items specific to two or more need categories 
VI. The pay for my management position 
VII. The feeling of being-in-the-know in my man- 
agement position 


Sample 


The sample of respondents consisted of 139 bot- 
tom-, and middle-level management personnel, of 
which 64 were first-level supervisors or foremen, and 
75 were middle-management individuals. (“Bottom 
level” is defined as those who are on the lowest level 
of management in the organization, and who gen- 
erally are the first-line of supervision; in a few cases 
individuals in this classification had no supervisory 
duties, but were considered management individuals 
by the organization employing them, and are there- 
fore included in the “bottom level” classification 
“Middle level” is defined as consisting of those po- 
sitions above the first level of supervision, but be- 
low the vice-presidential, company officer, or major 
departmental-head level.) 

The sample was obtained from three companies, 
which hereafter will be referred to as Companies A, 
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TABLE 1 
CHARACTERISTICS OF SAMPLE BY MANAGEMENT LEVEL 


Characteristics 


NV in Obtained Sample 
Potential V Available 

> Obtained N of Available V 
Median Age (vears) 

Median Seniority (years) 
Educational Level: 


©) having education bevond high school 


B, and C. 
as follows: 
Company A: This company is a large, nationwide 
organization manufacturing consumer container prod- 
ucts. One plant of this company was sampled; this 
plant employs over 1,000 workers and has 75 indi- 
viduals classified as “management.’”’ The management 
response rate was 53% from this company. 

Company B: This company is also a nationwide 
concern. It processes and distributes (with an em- 
phasis on the latter function) a type of food prod- 
uct. One of the two geographical divisions of this 
organization was sampled for this study. This divi- 
sion employs approximately 400 individuals, of which 
76 are classified as “management” (including sales 
supervisory personnel). The response rate from man- 
agement in Company B was 70%. 

Company C: This company is a medium-sized 
utility firm. Two divisions of this corporation, con- 
taining a total of some 600 workers and 77 manage- 
ment personnel, were used to obtain a sample for 
this study. For Company C, the response rate was 

Table 1 presents the characteristics of the obtained 
sample for the three companies combined. This table 
shows that a somewhat higher percentage of middle- 
management individuals responded to the question- 
naire instrument than did those in bottom manage- 
ment. This was probably due in part to the relatively 
higher educational level of the middle managers and 
their consequent ability and willingness to deal with 
written material. Table 1 also shows, however, that 
there was very little difference between the two 
managerial groups of respondents in either median 
age or median seniority. 


Briefly, the companies may be described 


Procedure 


The questionnaires were distributed individually to 
all of the members of management in each company, 
either by company or by United States mail. Ac- 


8 The author wishes to thank the companies that 
agreed to participate in this study, and their man- 
agement personnel who supplied the basic data. 


Management Levels 


Middle 


Management 


Bottom 
Management Total 
139 
228 

61.0 

43.7 

15.9 


58.0 


companying the questionnaire was a memorandum 
from the chief company officer in each plant or divi- 
sion explaining that the company had been asked to 
participate in a research project on management jobs, 
and that the company had agreed to cooperate. In 
this memorandum from the company it was made 
clear that no individual was required to answer the 
questionnaire, but each was strongly urged to do so. 
The memorandum stated that the questionnaire was 
to be filled out anonymously and that individual re- 
sponses would not be made available to the com- 
pany. Along with the questionnaires and the com- 
pany memorandum, each respondent received a 
stamped, self-addressed envelope in which to return 
his completed questionnaire directly to the investi- 
gator at the university. This method of collection 
was used not only to facilitate the mechanics of the 
process, but also to emphasize that this was a re- 
search project being carried out at a university and 
was not a company-sponsored study. 


RESULTS 
Perceived Deficiencies in Need Fulfillment 
Table 2 presents data concerned with the 
differences between perceived amount of pres- 
ent fulfillment of needs and the amount of 
fulfillment Ss believed should be available in 
their positions. The basic data for the table 
were obtained by ascertaining for each S for 
each questionnaire item whether the individual 
checked a higher scale-number for Part b of 
the item (“How much of the characteristic do 
you think should be connected with your man- 
agement position?”’) than for Part a of the 
item (“How much of the characteristic is 
there now connected with your management 
position?”’). Whenever Part b was checked 
higher than Part a, this was termed a “‘de- 
ficiency” in need fulfillment. 
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TABLE 2 
DIFFERENCES BETWEEN MANAGEMENT LEVELS IN PERCENTAGE OF SS 
INDICATING NEED-FULFILLMENT DEFICIENCIES 


‘ 


Bottom 
Memt 
Questionnaire Items" N = 64) 


sec urity needs 


Item I-a 


Social needs 
Item 
II-b 


Average for cutegor 


Esteem needs 
Item IIl-a 


Average for categor 
g 


\utonomy needs 
Item IV- 
IV 


Average for categor 


Self-actualization needs 
Item V-a 
V-b 
Average for catego 
Nonspec ific 
Item VI 
Item VII 


@ For complete wording 


* For method of co 


In Table 2, need fulfillment deficiencies in 
bottom-level management positions are com- 
pared with those in middle-level positions for 
each of the 15 questionnaire items (arranged 
hy need category), and for the five categories 
themselves. The percentages for items were 
computed by taking all of the individuals at 
each management level, without regard to 
company affiliation, and ascertaining the num- 
ber who checked a deficiency. Confidence lev- 
els of differences between the percentages for 
the two management levels on each of the 
15 items were determined by the standard 
method of computing the significance of dif- 
ferences between percentages. The resulting p 


Middle 


p Value for 
Categories 


> Value 


Ditierer for Items 


text in Results 


values for the items are given in the fourth 
column in the table. Percentages for cate- 
gories were computed by averaging the per- 
centages of the individual items in the cate- 
gories. (It should be noted that such averages 
are not necessarily based on a representative 
sampling of possible items within each cate- 
gory, and the obtained average-for-category 
values therefore should be interpreted with 
caution.) Because the category averages are 
based on extremely small Ns of items, the 
standard method of computing significance 
levels of differences between averages could 
not be applied. Therefore, to obtain levels of 
significance for category differences, the fol- 
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lowing procedure was used: for each need 
category each individual received a “score” 
based on the number of deficiencies checked 
for the items in a particular category; thus, 
for the esteem need category, for example, an 
individual could get a score of O—3, since 
there were three items in this category. A dis- 
tribution of scores was drawn up for each 
category, using the total number of Ss with- 
out regard to management level or company, 
and a dichotomy was made as near to the 
median as possible. The percentages of scores 
above the median split (“high scores’) were 
then used in the standard formula for the sig- 
nificance of differences between percentages. 
The resulting p values for the categories are 
presented in the fifth column of Table 2. 

Table 2 shows, first, that except for Items 
\V-a and VI, where the percentages were vir- 
tually equal, deficiencies were more frequently 
indicated for bottom- than for middle-man- 
agement positions. Items approaching or ex- 
ceeding statistical significance were ones con- 
cerning feeling of security, feeling of self- 
esteem, prestige inside the company, prestige 
outside the company, opportunity for inde- 
pendent thought and action, opportunity for 
participation in the setting of goals, and a 
feeling of being informed. These items that 
differentiated between the two management 
levels were, therefore, concentrated in the se- 
curity, esteem, and autonomy categories. 

Secondly, Table 2, as explained above, per- 
mits a direct comparison between the two 
levels of management for each of the five 
categories as a whole, as well as for the in- 
dividual items in the categories. The last 
column in Table 2 shows clearly that the se- 
curity, esteem, and autonomy categories pro- 
duced the greatest differences between bottom 
and middle management with regard to need 
fulfillment deficiencies. For the other two 
categories of social and _ self-actualization 
needs, the differences were in the direction of 
more frequent deficiencies in bottom manage- 
ment, but neither category had differences ap- 
proaching statistical significance. 

The trends evident in Table 2, on the dif- 
ferences between the two management levels 
in regard to need fulfillment deficiencies, are 
present not only when the data are combined 
from all three companies, as in Table 2, but 


also when the data are analyzed company by 
company. Thus, in each of the three indi- 
vidual companies, bottom-management indi- 
viduals more often indicated deficiencies than 
did middle-management personnel, with the 
esteem need area producing the largest differ- 
ence between the two management levels. The 
areas of security and autonomy tended to 
produce the next largest differences between 
the two management levels in all three com- 
panies. 

A third finding evident in Table 2 is that 
there was a trend in both bottom- and mid- 
dle-management levels for the perceived de- 
ficiencies to be indicated more frequently for 
the higher-order need categories (autonomy 
and self-actualization) than for the lower- 
order need categories (security and social 
needs). Only in the bottom-management level 
was there a slight reversal, where the security- 
need item had a higher percentage than the 
average for the two social need items. How- 
ever, both the security and social categories 
in both levels of management had smaller av- 
erages than did the higher-order categories. 
For the particular items and need categories 
used in this study, there appear to be greater 
deficiencies in need fulfillment in the so- 
called higher-order needs, both in bottom- 
and in middle-management positions. Again, 
this trend is clearly present in the separate 
data for each company as well as in the com- 
bined data from all three companies. 


Importance of Needs 


Table 3 presents data on the importance of 
the various types of needs to the individuals 
in lower- and middle-management jobs. The 
data in this table were based on Part c of the 
questionnaire items, “How important is this 
position characteristic to you?” Table 3 is 
comparable in construction to Table 2, and 
presents for each item and category the per- 
centage of individuals in the two management 
levels who checked “maximum” importance 
(i.e., who check 7 on a 7-point scale). 

Table 3 shows that in contrast to the find- 
ings on need fulfillment deficiencies, none of 
the categories as a whole produced significant 
differences between bottom and middle man- 
agement on the importance of need fulfill- 
ment. (The differences between categories 
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TABLE 3 


DIFFERENCES BETWEEN MANAGEMENT LEVELS IN PERCENTAGE OF SS 
CHECKING MAXIMUM IMPORTANCE OF NEEDS 


Cc 


Bottom Mi 


Ment. 
Questionnaire Items* (N = 64) 
Security needs 


Item I-a 


Social needs 
Item II-a 
II-b 


Average for category 


Esteem needs 
Item 
Ill-b 
IIl-c 


Average for category 


\utonomy needs 
Item IV-a 
IV-b 
IV-d 


Average for category 


Self-actualization needs 
Item V-a 
V-b 
Average for categor) 
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were analyzed by the same method as that 
used in connection with Table 2.) Four indi- 
vidual items—two autonomy items, one self- 
actualization item, and the pay item—did pro- 
duce significant differences between the two 
management levels. (Except for the pay item, 
the differences in the significant items were 
due almost entirely to relatively large differ- 
ences found in Company A. Respondents in 
the other two companies contributed very 
little to the size of the differences for these 
items. ) 

Table 3 also shows that in contrast to the 
results on fulfillment deficiencies, the items 


Mgnt. 
Difference 


(N = 75) 
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and categories tending to produce the largest 
indication of maximum importance for both 
levels of management were not solely the 
higher-order needs. As is evident in Table 3, 
the two areas of greatest importance were a 
higher-order need, self-actualization, and a 
lower-order need, security. The other rela- 
tively higher-order needs, especially esteem, 
received the lowest average percentages of 
maximum importance in both management 
levels. Thus, when the importance of need 
satisfaction is considered, respondents in both 
the bottom- and middle-management levels 
did not show a clear trend of increase from 
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lower-order needs to higher-order needs, as 
was the case when deficiencies in need satis- 
faction were considered. The absence of this 
trend for importance is present not only when 
respondents from all three companies are com- 
bined, as in Table 3, but also when the data 
from each company are analyzed separately. 


DIscuSSION 


The results show that for the sample of in- 
dividuals and companies studied, lower-level 
management positions were more likely to 
produce deficiencies in fulfillment of psycho- 
logical needs than were middle-level positions. 
This suggests that there exisis a differential 
opportunity within management to satisfy 
various motivational needs. The greatest dif- 
ferences between the two management levels 
occurred in a lower-order need area, security, 
and in two of the higher-order areas, esteem 
and autonomy. Contrary to expectation, mid- 
dle management was almost as dissatisfied as 
bottom management in the highest-order need 
area, self-actualization. In this area, only a 
relatively small difference was found between 
the two management levels. The differential 
opportunity within management to satisfy 
psychological needs appears to be much more 
prominent in areas other than self-actualiza- 
tion. 

It is essential to consider differences among 
need areas within management levels as well 
as the differences between management levels 
for given need areas. When this is done, the 
largest frequencies of perceived need fulfill- 
ment deficiency in both of the levels of man- 
agement were found to occur in the higher- 
order need areas, those of esteem, autonomy, 


and _ self-actualization. These are the same 
need areas that various writers have indicated 
are least satisfied in nonmanagement produc- 
tion positions. Thus, from the very bottom of 
organizations up through at least two-thirds 
of management, the higher-order needs are 
not being as well-satisfied as the lower-order 
needs. 

To assess the impact of need-fulfillment de- 
ficiencies, not only the size or amount of the 
deficiency must be taken into account, but 
also the importance of the particular need 
area to the individual involved. The relevant 
data that were obtained in this study showed 
no consistent over-all differences between the 
personnel in the two levels of management in 
how important they regarded fulfillment of 
various types of psychological needs. This 
finding contrasted with that for need fulfill- 
ment deficiencies, where definite bottom-mid- 
dle management differences were found in sev- 
eral areas of needs. Within both levels of 
management, security and self-actualization— 
the lowest-order need (of those studied) and 
the highest-order need—were perceived as the 
most important of the five need categories in- 
vestigated, whereas the analysis of need-ful- 
fillment deficiencies revealed that the greatest 
perceived deficiencies occurred in the three 
highest need-orders. 

Combining the results for deficiency with 
those for importance presents a picture which 
is summarized in Table 4. (The terms used 
in the body of Table 4 represent the approxi- 
mate relative degrees of deficiency and im- 
portance for the five need categories within 
each management level.) In both lower and 
middle management the most critical of the 
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SUMMARY OF RELATIVE 


IMPORTANCE 


Bottom Management 


Relative 
Need Categories Deficiency 
Security moderate 
Social small 
Esteem large 
Autonomy 


Self-Actualization 


large 


large 


Relative 
Importance 


moderate 


moderate 


NEED-FULFILLMENT DEFICIENCY AND NEED 
WITHIN MANAGEMENT LEVELS 


Middle Management 


Relative 
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Relative 
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small small small 
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need hierarchy areas is the area of self-actuali- 
zation because individuals in both levels of 
management considered it to be of prime im- 
portance and the area where there were the 
greatest deficiencies in need fulfillment. The 
next most critical areas for bottom manage- 
ment are security, which was considered of 
major importance and to have relatively mod- 
erate deficiency in fulfillment, and autonomy, 
which had a relatively large deficiency and 
was seen as of moderate importance. Esteem 
and social needs appear to be the least criti- 
cal need areas for persons in bottom-manage- 
ment positions. The pattern for the combined 
deficiency and importance results in middle 
management is somewhat similar to the pat- 
tern in bottom management. Security ranks 
below: self-actualization as a critical area for 
‘middle management because although the size 
of the perceived deficiency was relatively 
small for positions at that level, the impor- 
tance was relatively large. Even in middle 
management, however, security ranked above 
the autonomy, esteem, and social areas in the 
combined effect produced by deficiency and 
importance. 

Two of the items contained in the ques- 
tionnaire have so far been omitted from the 
discussion because they do not fit neatly into 
any of the five need categories specifically 
covered in this study. These are Items VI and 
VII, having to do with pay and with a feel- 
ing of being informed, respectively. These 
items have been considered separately because 
each seems to overlap into two or more need 
categories. The amount of pay one receives in 
his work would seem to satisfy both security 
and esteem needs, and is also a means of satis- 
fying primary physiological needs that could 
not be appropriately studied in this investiga- 
tion. Likewise. the need to be informed would 
seem to satisfy social, esteem, and perhaps 
autonomy needs. Even though neither of these 
items could be related to only one specific 
need fulfillment category, each is obviously 
important to consider since they are so inti- 
mately a part of a person’s thoughts and feel- 
ings about the job and since they provide 
multiple psychological satisfaction. The pay 
item was found to have the highest frequency 
of responses indicating deficiency in fulfill- 
ment in both levels of management. It was 
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regarded as having major importance by lower- 
management individuals, and relatively mod- 
erate importance by middle-management per- 
sonnel. Thus, pay, perhaps because it does 
satisfy several types of needs, is a crucial 
item for both lower-management and middle- 
management individuals. The feeling of being 
informed, as measured by Item VII in this 
study, produced the second most frequent in- 
dication of need fulfillment deficiency in both 
lower- and middle-management positions, al- 
though its frequency in middle management 
was significantly below that for the lower 
level. In terms of importance, it received 
relatively moderate support in both groups. 
Therefore, being “in” on plant and company 
information is also a key area in fulfillment 
of needs of personnel in lower management. 
and a moderately crucial area for middle 
management. 


SUMMARY AND CONCLUSIONS 


This study investigated perceptions of bot- 
tom- and middle-management jobs. Specif- 
ically studied were perceptions of need ful- 
fillment deficiencies and need importance 
Five need areas were selected for investiga- 
tion because of their relevance to the concept 
of a hierarchy of prepotency of needs, and 
their relevance to management positions. The 
five need areas were: security, social, esteem, 
autonomy, and self-actualization. The percep- 
tions regarding these needs were obtained by 
the administration of a questionnaire to 64 
bottom-management and 75 middle-manage- 
ment individuals in three separate industrial 
organizations. 

The following are the major conclusions 

1. The vertical location of management po 
sitions appears to be an important variable in 
determining the extent to which psychological! 
needs are fulfilled. 

2. The greatest differences in the frequency 
of need-fulfillment deficiencies between bot- 
tom- and middle-management positions occur 
in the esteem, security, and autonomy need 
areas. These needs are significantly more 
often satisfied in middle than in bottom man- 
agement. 

3. Higher-order psychological needs are 
relatively the least satisfied needs in both 
bottom and middle management. 
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4. Self-actualization and security are seen 
as more important areas of need satisfaction 
than the areas of social, esteem, and au- 
tonomy, by individuals in both bottom- and 
middle-management positions. 

5. The highest-order need of self-actualiza- 
tion is the most critical need area of those 
studied, in terms of both perceived deficiency 
in fulfillment and perceived importance to the 
individual, in both bottom and middle man- 
agement. This need is not perceived as sig- 
nificantly more satisfied at the middle-man- 
agement level than at the bottom-manage- 
ment level. 
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THE EFFECT OF EXPERIENCE IN TELECOURSES ON 
ATTITUDES TOWARD INSTRUCTION BY TELEVISION 
AND IMPACT OF A CONTROVERSIAL TELEVISION 
PROGRAM 


RICHARD I. EVANS, BETTY A. WIELAND, ann CHARLES W. MOORE 


University of Houston 


Research in noncommercial educational tele- 
vision (ETV) has been concerned with a num- 
ber of problems but, in general terms, two di- 
rections of such research may be suggested. 
One seems primarily involved with television 
as a medium of instruction; the other, with 
a wide range of aspects of general noncom- 
mercial educational programing, from such 
problems as identification of the general ETV 
audience to studies of impact of specific ETV 
programs. 

The present study was designed to investi- 
gate problems stemming from both directions. 
More specifically, it dealt with the following 
general problems: 

1. What is the effect of a single program on 
viewer attitudes? Since, as suggested in an 
earlier study (Evans, 1957), the audience for 
educational programs is not likely to watch an 
entire series consistently but may often view 
only one program of a series, evaluation of 
the impact of any program in a series becomes 
important. A previous studv (Asher & Evans, 
1959) indicated that single programs of se- 
ries such as those produced by the National 
Educational Television and Radio Center do 
have a measurable effect on viewers. While 
this study demonstrated that attitude changes 
toward certain concepts in the program could 
be measured, it did not explore the effect on 
such change of already existing theoretically 
related attitudes. The present investigation at- 
tempted not only to study possible attitude 
changes that occurred as a result of viewing 
a program but also endeavored to examine 

1 Supported in part by a grant-in-aid awarded to 
the senior author by the National Educational Tele- 
vision and Radio Center, Ann Arbor, Michigan. The 
authors are indebted to psychology graduate students 
George Woodward and Montrose Wolf for their as- 
sistance in implementing this project 


such changes in terms of an already existing 
theoretically related attitude. 

2. What is the effect of previous experience 
in taking a telecourse on attitudes toward tele- 
vision as an instructional medium? Since the 
subjects in our present investigation were stu- 
dents who had been in an institution where 
instructional television had been utilized for 
several years, it seemed appropriate to ex- 
amine the effect of such experience on atti- 
tudes. A previous report by the senior author 
(1956) suggested that factors other than tele- 
vision itself might account for unfavorable 
attitudes toward television. For example, dis- 
like of the subject matter of the telecourse. 
the instructor, or the grade received could re- 
sult in a displacement of feeling which would 
be expressed as an unfavorable attitude to- 
ward television. In effect, it is possible that 
the medium is blamed for disturbances stem- 
ming from attitudes toward other factors in 
the telecourse situation. In the present study, 
attitudes toward television as a function of 
experiences with telecourses were examined 
in terms of other possible contributing factors. 


METHOD 
Subjects 


The subjects were 160 students enrolled in five 
sections of introductory psychology at the Univer- 
sity of Houston. For the attitude change section of 
the study, one section (nm = 33) was designated as the 
control group (C) and another (m= 24) as the ex- 
perimental group (E). The classes did not differ sig- 
nificantly in responses to the pretesting battery of 
tests as described below, so they could be considered 
adequately matched for purposes of the present in- 
vestigation. 


Experimental Program 


The television, program used in the present study, 
entitled Roots of Prejudice, is part of a series, People, 
produced by the National Educational Television 
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and Radio Center? for national distribution. This 
program, one-half hour in length, presents a class- 
room situation in which psychology instructors dis- 
cuss with students the development and character- 
istics of racial and religious prejudice. Psychology 
graduate student and faculty previewers agreed that 
two goals of the program were to challenge the va- 
lidity -of racial or religious stereotyping and to es- 
tablish the need for social tolerance. The program 
can clearly be described as controversial on the basis 
of its potential emotional impact on the social climate 
of Houston, a large, eastern-Texas community: at 
present, Houston has the largest segregated school 
system in the United States. 


Psychological Measuring Instruments 


A group of four social psychological measurement 
devices was used. The first of these, the Evans Quasi- 
Role-Playing Device, was developed for an earlier 
study by the senior author (1952). It is a semi 
projective device which yields both quantitative and 
qualitative indices of the intensity of the respond- 
ent’s prejudice toward social objects. The respondent 
is asked to imagine or “cognitively role play” him- 
self as being either prejudiced or unprejudiced and 
to give as many reasons as he can for being preju- 
diced or unprejudiced. In the present study, the sub- 
jects were asked to “role play” an individual favor- 
able to and one hostile to the Supreme Court de- 
segregation decision, listing as many arguments as 
they could in favor of and in opposition to the de- 
cision to desegregate the schools 

The second test used was a modification of the 
widely used Osgood Semantic Differential (Osgood, 
1957) designed to measure several dimensions of atti- 
tudes. Eleven scales were used in various combina- 
tions, good-bad, pleasant-unpleasant, safe-dangerous, 
valuable-worthless, fair-unfair, complete-incomplete, 
clear-hazy, high-low, clean-dirty, weak-strong, and 
active-passive. With each concept, the good-bad and 
pleasant-unpleasant scales were used in conjunction 
with two other evaluative scales, one potency scale 
(weak-strong), and one activity scale (active-pas 
that each concept was measured by six 
There were two groups of concepts toward 
which attitudes were measured. One group of con 
cepts used was related directly to ideas presented in 
the program and included racial discrimination, inte- 
gration of the races, Negroes, white people, Blake 
(an Anglo-Saxon stereotype), and Goldberg (a con 
trasting name stereotype and the name of one of the 
participants in the program). The other concepts 
used were of a more peripheral nature, psychologist, 
classroom lecture, and television course. Scoring con 
sisted of summing the four evaluative-measure score 
of a subject on a specific concept. Thus, each sub- 
ject was given one evaluative score for each of the 
concepts on which he expressed himself. 
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scales 
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The third test used in the battery was the Ethno- 
centrism Scale (Adorno, Frenkel-Brunswik, Levinson, 
& Sanford, 1950). The Ethnocentrism Scale purports 
to measure the degree of general intolerance of racial 
and religious out-groups. A 10-item form was used. 

The fourth measure used, a personal data sheet, 
included another application of the Osgood Semantic 
Differential (Evaluative Dimension). In this instance, 
the good-bad scale was ustd to evaluate the attitudes 
toward concepts related to methods of course pres- 
entation of those students who had completed tele- 
courses at the university. Information was also ob 
tained from the respondent concerning sex, major 
field in college, age, class, grade point average, and 
previous experience with telecourses 


Procedure 


Pretesting of both groups was carried out during 
a regular class meeting. Following the administration 
of the pretests and during the next regularly sched- 
uled meeting two days later, the experimental group 
was shown the experimental television program, 
Roots of Prejudice, by closed circuit television in 
specially designed viewing rooms at the university. 
Posttesting was effected at the same session for the 
experimental group and at a comparable time for 
the control group. 

The experimental procedures were instituted in 
such a as to provide means for investigating 
the two problems: the extent to which a single view- 
ing of the program would affect viewers’ attitudes, 
and the extent to which students’ experience with 
telecourses would affect their attitudes toward tele- 
vision instruction. 

With reference to the first problem, the procedure 
was to compare pre- and posttest scores of the ex 
perimental and control groups. In the case of the 
other problem, the effect of the course-taking ex 
periencé with educational television on attitudes to- 
ward that medium, the analysis of data involved 
some of the responses on the personal data sheet and 


way 


general This provided for 
each subject a sum of four scores on the evaluative 
dimension for the topics, classroom lecture and tele- 
vision lecture, and provided information pertaining 
to the student’s experience (or lack of it) with edu 
cational television, his success in courses taken by 
television, and his evaluation of such courses (a 
good-bad Osgood scale). To investigate the influence 
of experience, the experienced and nonexperienced 
groups were compared in terms of their Osgood 


Osgood scale responses 


scores, 
RESULTS 

Effectiveness of the Program 

The control and experimental groups were 
compared by means of Wilson’s distribution- 
free test of analysis of variance hypothe- 
ses (Wilson, 1956) in terms of their pretest 
scores to determine whether or not they were 
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Telecourses and Attitudes toward Television 


matched groups in terms of Osgood scale re- 
sponses with respect to the nine concepts. No 
significant differences were found. Therefore, 
in order to determine the effectiveness of the 
film and to eliminate the possibility of “re- 
sponse set” of the pretest itself, comparisons 
of C and E on the posttests were made using 
the same statistical test. Table 1 reports these 
data. It indicates the concepts examined and 
the chi square value for the posttest differ- 
ences on each of three Osgood factors, evalua- 
tive, potency, and activity. As is apparent 
with one degree of freedom and a chi square 
value of 3.84 necessary for significance at 
even the 5% level of confidence, none of the 
differences even approaches significance. 

Pre- and posttest responses in E on the 
Evans Quasi-Role-Playing Device were com- 
pared with respect to the number of “dislike” 
and “like” responses made with respect to the 
Supreme Court desegration decision. In both 
the pre- and posttests a 51° to 49° break- 
down of the proportion of dislike to like re- 
sponses was present, indicating, of course, in 
a quantitative sense no apparent effect of 
the film on expressed rationalizations for or 
against the Supreme Court decision. 

An analysis of the Evans test responses in 
qualitative terms by virtue of an examination 
of the content of the pre- and posttest reasons 
given for accepting or rejecting the Supreme 
Court segregation decision also indicated no 
significant impact of the experimental films. 
This analysis, then, suggested that the tele- 
vision program had no measurable effect on 
the attitudes of E. 

To explore more thoroughly the possibility 
of some impact of the program as a function 
of a theoretically related attitude, ethnocen- 
trism, another procedure was introduced. This 
involved the comparison of the extremely 
prejudiced (above the 75th percentile) and 
unprejudiced (below the 25th percentile) 
groups as measured by E-Scale responses. For 
each of these groups, medians were computed 
ior pretest and posttest scores on the Os- 
good. If related attitudes operate as influences 
in such situations, individuals with extreme 
scores on the ethnocentrism measure would 
tend to be affected differently by the film. 
The pre- and posttest medians on the nine 
topics of the Osgood were compared by ¢ tests. 


TABLE 1 


Cut SQuARE VALUEs (1 df) 
PosTTEST RESPONSES ON OsGoop ScALes or C 
(nm = 33) 


OF COMPARISONS BETWEEN 


AND FE. (m = 24 
scales 


Concept ativ otency Activit 
Psychologists 

Racial prejudice 

Classroom lecture 

Negroes 

The name Blake 

Integration of races 
Television instructio 
White people 

The name Goldberg 


Note Chi square equal t 


ignificance at the § 


No significant differences were found for the 
high prejudice group. For the low prejudice 
group, there was a shift in medians on both 
Blake and Goldberg on the evaluative factor 
toward a less favorable or valued position. 
The former difference was significant at the 
.O1 level of confidence, the latter at the .02 
level. To rule out the possibility that two sig- 
nificant ¢ tests out of a group of 18 computed 
could be due to chance, an examination of a 
table for the test of significance for a series 
of statistical tests (Sakoda, Cohen, & Beall, 
1954) was made. This revealed significance 
of these findings approaching the 1°% level of 
confidence. 


To determine the extent to which experi- 
ence with television as a medium of instruc- 


tion was related to students’ attitudes to- 
ward that medium, the total » of 160 was 
administered the personal data sheet and was 
divided into two groups, the television-experi- 
enced group (m= 45), those who had en- 
rolled in and earned grades in one or more 
telecourses, and the television-nonexperienced 
group (m = 115), those who had not been en- 
rolled in telecourses. These two groups were 
compared on the Osgood concepts, classroom 
lecture, and television lecture. The ¢ tests of 
the differences between medians for the two 
groups were not significant on either of these 
concepts. 
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A t test of the median values placed on the 
concepts, classroom lecture, and television lec- 
ture, by the TV-experienced group yielded a 
difference significant at the .01 level of con- 
fidence. This difference favored the classroom 
lecture as a medium of instruction. A likely 
inference from this result, of course, is that a 
less favorable attitude toward television in- 
struction resulted from experience in taking 
telecourses. It still seemed possible, however, 
that factors other than experience, per se, 
might have contributed to this result. There- 
fore, a phi coefficient was computed between 
attitudes toward television lecture and class- 
room lecture involving the total m of 160 to 
vield some insight concerning the amount of 
variance accounted for by this relationship. 
The obtained coefficient of .29, although sig- 
nificant at the .05 level of confidence, clearly 
suggested that, indeed, a great deal of the 
variance was unaccounted for. Therefore, a 
phi coefficient between! attitude toward tele- 
vision lecture and grades earned in the tele- 
course was computed for the TV-experienced 
group. This computation yielded a value of 
.36, significant at the .01 level. It appears, 
therefore, that although negative attitudes to- 
ward television as a medium of instruction 
may partly result from experience per se with 
such instruction, poor grades in the course 
may not be ruled out as a possible factor con- 
tributing to the formation of negative atti- 
tudes. 


DIscUSSION 


Certain tendencies in the present study 
might be further examined. The fact that Os- 
good scale scores demonstrated that so-called 
tolerant subjects, as measured by the E Scale, 
became significantly even less likely to stereo- 
type names like Blake and Goldberg and re- 
gressed toward the mean with respect to ex- 
pressed attitudes, while the intolerant group 
revealed no signi/i\cant change, is an interest- 
ing finding. Thi: would tend to support the 
widely held theory that individuals who hold 
strong attitudes (in this case extreme Osgood 
scale responses to the names Blake and Gold- 
berg) are relatively uninfluenced by communi- 
cation messages which challenge these strong 
attitudes, while individuals holding less in- 
tense attitudes (in this case Osgood responses 


which suggest only a slight tendency to stereo- 
type) are more vulnerable to such communi- 
cation messages. 

With respect to the question of the attitude 
toward television as an instructional medium, 
there is evidence present in support of the 
proposition that “out of context” studies of 
instructional television may lead to invalid 
conclusions. The demonstration in the pres- 
ent study of the influence of grades received 
in telecourses gives some support to this pos- 
sibility. The contention proposed here is that, 
if all related variables (e.g., attitude toward 
the instructor, the subject matter itself, gen- 
eral motivation arousal characteristics of the 
course) could be held constant, television as 
a teaching medium would take its place along 
side other educational media; that is, they 
are all equally well received in proportion to 
the extent that effective learning principles 
are implemented in their use. 


SUMMARY AND CONCLUSIONS 


The present study was designed to investi- 
gate two questions: 

1. To what extent would a single contro- 
versial educational television program change 
viewers’ attitudes? 

2. How would previous experience with tele- 
courses affect students’ attitudes toward tele- 
vision instruction as contrasted with tradi- 
tional classroom instruction? 

Introductory psychology students, compris- 
ing a control and an experimental group. were 
administered a group of psychological instru- 
ments consisting of the Ethnocentrism Scale, 
the Evans Quasi-Role-Playing Device, and 
nine concepts using the Osgood Semantic Dif- 
ferential (Evaluative Dimension). A personal 
data sheet elicited information concerning 
subjects’ experience with and evaluation of 
telecourses. Following pretesting, the experi- 
mental group viewed by closed circuit tele- 
vision the program, Roots of Prejudice. Post- 
testing procedures involved the administra- 
tion of the Evans test and the Semantic Dif- 
ferential to both groups. 

Analyses of the data lead to the following 
conclusions: 

1. The educational television program, 
Roots of Prejudice, succeeded, at least as 
measured by immediate posttest attitudes. in 
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causing relatively nonethnocentric viewers to 
engage in less intense name stereotyping, did 
not alter attitudes of ethnocentric viewers, 
and, in brief, fell short of the apparent goals 
of the program. 

2. Negative attitudes toward television in- 
struction may be less the result of experience 
in taking telecourses than of such factors as 
poor course performance. In other words, tele- 
vision as a medium of instruction may be- 
come an available “whipping post,” because 
of its novelty, for latent hostile attitudes aris- 
ing from other factors in the college course 
situation. 
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SELF-DESCRIPTION AS A PREDICTOR OF SUCCESS 
OR FAILURE IN FOREIGN SERVICE 
CLERICAL JOBS 


REGIS H. WALTHER 


Department of State 


Cronbach (1960), Bogard (1960), and Ghi- 
selli and Barthol (1953) have reported stud- 
ies showing a relationship between self-report 
questionnaires and job success. The present 
study was designed to determine whether re- 
sponses given by Foreign Service clerical per- 
sonnel to questions about attitudes, interests, 
activities, and family background could be 
used to predict success or failure in various 
clerical jobs in the Foreign Service and thus 
improve the quality of performance and re- 
duce turnover. 


METHOD 


Starting in the spring of 1954 at the recommenda- 
tion of the Civil Service Commission, the Depart- 
ment of State gave all new Foreign Service clerical 
employees the day after they entered on duty a 68- 
item multiple choice questionnaire including ques- 
tions relating to grades and interests in school, job 
characteristics most liked or disliked, relationship 
with parents, social activities, steadiness of employ- 
ment, hobbies or outside interests, etc. Sample ques- 
tions are: 


Do you entertain groups at home: 
a. Frequently 
b. Occasionally 
c. Almost never 
Have you: 
a. Often been double crossed by people 
b. Sometimes been double crossed by people 
c. Been double crossed once or twice by people 
d. Never been double crossed by people 
Were your parents: 
a. Always very strict with you 
b. Usually very strict with you 
c. Seldom very strict with you 
d. Never very strict with you 


Which of the following characteristics of a job is 
least important to you: 
a. Opportunity to ask questions and consult about 
difficulties 
. Opportunity to understand just how one’s su- 
perior expects work to be done 
Certainty one’s work will be judged by fair 
standards 


d. Freedom in working out one’s own method of 
doing the work 

e. Co-workers—Congenial, 
quate in number. 


competent, and ade- 


The responses given to these questions have been 
used only for research purposes. The experimental 
group (NV = 1183) consisted of all clerical emplovees 
who had completed the questionnaire prior to De- 
cember 1, 1956. For each of these employees it was 
determined (a) if he still was employed as of Au- 
gust 30, 1959; (b) if he had resigned or been sepa- 
rated, how long he had served before leaving; (c) 
the reason given for his resignation or separation 
(poor performance, dissatisfaction, marriage, or other 
reasons); (d) the level of his performance (above 
average, or below average). The determi- 
nation regarding level of performance was based on 
the average of the summary numerical ratings con- 
tained in the employee’s personnel file. 

The data was first processed by dividing the sub- 
jects into job categories and then by means of an in- 
formal item analysis determining which items seemed 
to predict success or failure for the various job cate- 
gories. It was found that there were substantial dif- 
ferences among the employees assigned to the major 
job categories (secretaries, mail and records clerks, 
code clerks, pouch clerks, and accounting clerks). It 
was also found that the same items which seemed to 
predict tenure also appeared to predict quality of 
work 

The next step was to attempt to find the reasons 
why particular responses seemed to be related to suc- 
cess or failure in the different clerical jobs. A series 
of group meetings attended by individuals who had 
either performed or supervised one of the jobs being 
studied was held to get information about the satis- 
faction contained in the work and the qualities pos- 
sessed by the individuals who do well as opposed to 
those who do poorly. A series of hypotheses was de- 
veloped based both on pertinent statistical informa- 
tion as well as the information developed from the 
group meetings and the judgments of experienced 
personnel officers. These hypotheses related to the be- 
havioral requirements for success in the jobs being 
studied and the degree to which they were important 
for the different jobs. Questionnaire responses were 
then grouped together on the basis of four criteria: 
(a) some positive intercorrelation, not necessarily 
statistically significant, (b) logical relationship to th: 
hypotheses, (c) ability of the item to distinguish be 
tween above average and below average performers 
in the job category in which the hypothesized be- 
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TABLE 1 


'ESTIONNAIRE RESULTS BY JOB CATEGORY 


Experimental Group 


J ob ¢ ‘ategory 


I. Secretaries 
A. Still Employed 
Above average 
Average 
Below average 
Doing other work 
Potal 


B. Resigned or Separated 
Poor performance 
Dissatisfaction 
Other reasons 
Marriage 
Total (excluding marriage) 


Total (including marriage) 


Il. Code Clerks 
A. Still Employed 
Above average 
Average 
Below average 


Potal 


B. Resigned or Separated 
Poor performance 
Other 
Total 
Ifl. Mail and Records Clerks 
A. Still Employed 
Above average 
Average 
Below average 
Potal 


B. Resigned or Separated 
Poor performance 
Other 
Marriage 

Total 


havioral requirement appeared to be most important, 
(d) ability of the item to distinguish above average 
employees in the job category presumably most in- 
fluenced by the characteristic from above average 
employees in the job categories least influenced by 
the characteristic. For example, it was hypothesized 
that a high degree of social isolation was a negative 
consideration for secretaries and that high degree of 


sociability was a negative consideration for code 


Mean 


‘ross-Validation Group 


SD Mean SD 


clerks. Therefore 
lated to degree of isolation, 
guished above average from below 
taries, and which also distinguished 
average secretaries and clerks 
together. The judgments were made by the write: 
From this process there emerged clusters 
items, each cluster suggesting a particular behavioral 
style or capability 


logically seemed re 


which 


items which 


social distin- 
average 
between 
were all placed 


secre- 
above 
code 


of 


seven 
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The next step was to select 20 high performers and 
20 low performers for each of the job categories. The 
high group consisted of employees who had stayed 
with the job for at least 3 years and consistently 
had received a superior rating. The low groups con- 
sisted of employees whose services clearly were not 
satisfactory. Weights were given to the items so as 
to create the maximum numerical spread between the 
high and low groups and between the high groups 
ior different occupational categories. Weights were 
also given to each of the seven clusters for each of 
the occupational categories, thus developing scoring 
keys for each of the seven clusters of items and for 
the five job categories. The weighted scoring keys 
were then applied to the experimental group and 
adjustments made in weights for the individual items 
and for the cluster in order to improve the effective- 
ness of the scoring keys. The results are contained in 
Table 1. 

The cross-validation group was composed of all 
Foreign Service clerical employees hired between De- 
cember 1, 1956 and March 30, 1957 for secretaries, 
and between December 1, 1956 and December 30, 
1957 for all other clerical employees (excluding those 
who had transferred to Civil Service), plus any cleri- 
cal employee hired after these dates whose services 
clearly had been unsatisfactory. The results of this 
part of the study are also contained in Table 1. 

For the most part the duties of the five jobs can 
be inferred from the titles. The secretary performs 
typical secretarial duties except that there is a more 
frequent change in bosses due to the fact that per- 
sonnel are transferred periodically from post to post. 
The mail and records clerk classifies and files incom- 
ing documents and searches for particular documents 
upon request. The code clerk follows a set procedure 
for enciphering and deciphering telegrams. He must 
have some mechanical aptitudes and be able to work 
alone without direct supervision. For security reasons 
he works. in a confined area behind a locked door 


and the work does not involve much social inter- 
course during the workday. He either works on shifts 
or must be available for call during the off hours. 
The pouch clerk performs clerical and messenger 
work related to receiving and dispatching all mail 
pouches transmitting mail between the Department 
of State and Foreign Service posts. The accounting 
clerk performs various record-keeping functions re- 
lated to the disbursement of funds, and the obliga- 
tion and liquidation of accounts at Foreign Service 
posts. 

The criteria used to measure success were perform- 
ance ratings and tenure on the job. However, only 
the performance criteria was used for developing the 
scoring keys. Because most personnel decisions are 
made in Washington, the Department of State con- 
siders it essential that the written record for the em- 
ployee be adequate, and therefore a great deal of 
effort is expended to make the ratings as useful as 
possible. Each individual included in the study has 
been rated at least three times by a minimum of two 
raters, plus at least one rating by a Foreign Service 
inspector. During 1955, 124 Foreign Service employees 
were given a special rating with the raters assured 
that the rating would be used solely for research 
purposes. Three evaluations were obtained for each 
employee, one from the supervisor, one from the ad- 
ministrative officer, and one from a co-worker. The 
average of these ratings was found to correlate .77 
with the ratings contained in the employee’s per- 
sonnel file. 

Accurate records were available regarding the length 
of service of all the employees included in the study. 
Each resigning employee is required to send a letter 
giving his reasons, and in about half the cases exit 
interviews are conducted. Employees included in the 
experimental or cross-validation group who had been 
separated or had resigned were divided into four 
categories: Poor performance, Dissatisfaction, Mar- 
riage, and Other reasons. If a review of the em- 


TABLE 2 


COMPARISON OF ABOVE AVERAGE EMPLOYEES 


Experimental Group 


SD 


Job Category N Mean 


Secretarial Key 

a. Secretaries 

b. Other clerical employees 
Code Clerk Key 

a. Code clerks 

b. Other clerical employees 
Mail and Records Clerk Key 


a. Mail and Records clerk 
Other clerical employees 


* Significant at the .01 level or better 


Cross-Validation Group 


Mean SD t 
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TABLE 3 


COMPARISON OF ABOVE AND BELOW AVERAGE EMPLOYEES ON INDIVIDUAL ELEMENTS 


Above Avg. 


SD 


78) 
1.7 
3.0 
2.0 
1.9 


Mean Mean 


Job Category 


A. Secretaries (N 
—1.3 
—.1 
1.0 


Please Authority 
Social Isolation 

Social Leadership 
Reaction to Aggression 
Accept Standards 
Accept Routines 

Use of Intellect 


to 


B. Code Clerks 
Please Authority 
Social Isolation 


= 


Social Leadership 
Reaction to Aggression 
Accept Standards 
Accept Routines 

Use of Intellect 


* Significant at the .05 level. 
** Significant at the .01 level or bette 


ployee’s file indicated a poor performance record, he 
was placed in the Poor performance category re- 
gardless of the reason for the separation. Included in 
the “Other reasons” category were individuals who 
probably would have been placed in the “Poor per- 
formance”’ or “Dissatisfaction” categories if more in- 
formation had been available about their perform- 
ance or real reason for leaving. 


RESULTS 


By far the best results came from the secre- 
tarial groups. When the secretarial key was 
applied to the cross-validation group it was 
possible to predict performance with a prod- 
uct-moment correlation of .60 and turnover 
excluding marriage with a tetrachoric corre- 
lation of .49. The scores on the individual 
elements were surprisingly consistent between 
the experimental and cross-validation group. 
The predictions for the code clerks developed 
a correlation of .38 for performance and .30 
for turnover. For the mail and records clerks 
it was .28 for performance and .36 for turn- 
over. The results for the pouch clerks were 
disappointing. It is believed that the reason 
for this is that pouch clerks are motivated 
primarily by nonjob factors. 


Experimental Group 


Below Avg. 


SD 


(N = 45) 


1 
4 
1 
1 


wr 


Cross-Validation Group 


Above Avg. Below Avg 


Mean SD Mean SD 


(N = 14) 
—16 
4 
& 

3 
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Table 2 gives the results of comparing the 
scores of above average secretaries, code 
clerks, and mail and records clerks. For secre- 
taries the differences are significant at better 
than the .01 level for both the experimental 
and cross-validation groups. Table 3 shows 
the comparison of above average and below 
average employees on individual elements. 
For the secretaries the difference between the 
two groups was significant at better than the 
.01 for three of the elements. 


DISCUSSION 


The seven clusters of items suggested dii- 
ferent behavioral styles and capabilities which 
have an important influence on work. Three 
of the clusters (Please Authority, Accept Rou- 
tines, and Accept Standards) seemed to relate 
to the way the individual likes to get his guide 
lines or instructions for doing his work— 
whether he responds primarily to the wishes 
of authority, the orders of authority, or the 
standards of authority. The next three ways 
(degree of Social Isolation, Social Leadership, 
and Reaction to Aggression) seemed to relate 
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to the way the individual deals with people. 
The last cluster (Use of Intellect) seemed to 
relate to the way the individual wants to use 
his mental capacities. 

The following assumptions have been de- 
veloped regarding the specific meaning of 
these elements and the effect that a high or a 
low score has on work: 

Please Authority. This element relates to 
the degree to which the individual tries to 
please persons in authority. It involves more 
than obedience, as it includes personal satis- 
faction in getting approval from the boss. An 
individual who scores high on this factor 
seems to prefer to work closely with a super- 
visor, is sensitive to his wishes, and often 
knows what the boss wants or what he will 
do, without being told. A person who has a 
negative score probably is not able or willing 
to anticipate the wishes of the boss. He is 
likely to prefer work involving a minimum 
of personal supervision. Examples of posi- 
tive items defining the element are: excellent 
grades in high school and not being charged 
with speeding during the last 5 years. Exam- 
ples of negative items are: frequently arguing 
with parents during adolescence on political, 
religious, and other issues and considering the 
opportunity to understand just how one’s su- 
perior expects work to be done less important 
than the other choices. 

Acceptance of Routines. This element re- 
lates to the degree to which an individual ac- 
cepts imposed routine and is content to fol- 
low it. A person with a high rating seems to 
prefer specific, concrete activities rather than 
ones involving variety and change, and to 
want to work under specific instructions. The 
instructions can be communicated either per- 
sonally or impersonally. A negative score 
seems to indicate a desire for variety and 
change and an impatience with routine. Ex- 
amples of positive items are: disliking inter- 
ruptions more than monotony and having 
worked at the same job all his life. Examples 
of negative items are: parent never very strict 
and considering steadiness and permanence of 
work less important than other choices. 

Acceptance of Standards, This element re- 
lates to the degree to which the individual 
tries to comply with standards. A person with 
a high score seems to want to do things well 


Walther 


and is able to work within set tolerance or 
standards. A person with a negative score 
may not do too well with work which requires 
unusual accuracy. Examples of positive items 
are: rarely or never disagreeing with parents 
during adolescence on political, religious, so- 
cial, or other issues, and always meeting finan- 
cial obligations promptly. Negative items are: 
getting fair or poor grades in school, and par- 
ents always or never very strict. 

Social Isolation. This element relates to the 
degree to which the individual withdraws from 
people and avoids social relationships. A per- 
son with a high rating seems to prefer to work 
with things or data rather than people and not 
to object to working in isolation. A negative 
rating seems to indicate a desire to work with 
people and a dislike of isolation. Examples of 
positive items are: attends parties or social 
affairs several times a year or almost never, 
and has only a few friends. Negative items 
are: entertains groups at home frequently and 
likes to play bridge. 

Reaction to Aggression. This element re- 
lates to the degree to which the individual 
takes direct action when he thinks his rights 
are being violated. A person with a high score 
responds to what he perceives as aggressive 
behavior on the part of other people with ag- 
gressive behavior of his own. A negative rat- 
ing seems to indicate a tendency to placate, 
avoid, divert, or ignore aggressive behavior. 
Examples of positive items are: never has 
difficulty getting rid of salesman and com- 
plains to waiter whenever complaint is justi- 
fied. Negative items are: never been double 
crossed by people and never breaks into con- 
versation. 

Social Leadership. This element relates to 
the degree to which the individual takes the 
lead in social activities. A person with a high 
score seems to prefer activities involving the 
influencing of others through social type re- 
lationship. A negative score may indicate a 
preference for activities involving data or 
things rather than people. Examples of posi- 
tive items are: when in school was a member 
of several clubs and organizations and has 
held positions as an officer in an organization 
during the last 5 years. Negative items are: 
almost never entertains groups at home and 
never has held a position in an organization. 
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Use of Intellect. This element relates to the 
degree to which an individual wants to use 
intellect when dealing with problems of work. 
The higher the score the more likely the in- 
dividual will be interested in planning, using 
judgment, and solving problems. The lower 
the score the more desire for routine, repeti- 
tive, short cycle activities. It seems likely that 
each job has a range for this factor. Examples 
of positive elements are: likes to play chess 
and stood in the top of 10°% of his college 
class. Negative items are: reads newspapers 
once a week or less and had no hobbies as a 
child. 

With respect to the specific occupational 
categories it appears that a successful ‘secre- 
tary likes to please the boss and is responsive 
to his requests, accepts standards, and is in- 
clined to take the lead in social relationships. 
On the other hand any significant degree of 
social isolation, resistance to or rejection of 
authority, or rejection of routines or bounda- 
ries are predictors of failure. She is likely to 
with aggression from others in other 
ways than fighting back. If she scores above 
a certain point on the Social Leadership or 
Use of Intellect scale she is likely to transfer 
to other work or to resign. 

The successful code clerk seems to be less 
sociable, shows less social leadership, and is 
more willing to accept routines than the poor 
clerk. There general agreement 
among officers experienced with either super- 
vising or assigning code clerks that they pre- 
fer to be supervised primarily by regulations 
rather than people. 

It was the opinion of experienced officers 
that the mail and records clerk should be self- 
reliant, be able to make decisions for herself, 
be able to negotiate successfully with other 
offices on procedural problems relating to rec- 
ords work, be able to work under pressure 
and budget her time, and work at a wide va- 
riety of tasks recognizing the relative impor- 
tance of each. None of the elements identified 
through this study méeasured these charac- 
teristics, perhaps because the items in the 


deal 


code was 


questionnaire were not sufficiently compre- 
hensive. There was a slight suggestion from 
the data that the mail and records clerks were 
more aggressive than the other clerks with re- 
spect to getting other people to do what they 
ought ‘to do. It should be noted that the mail 
and records group has relatively low prestige 
and many individuals assigned to this work 
consider that they have been sent to ‘Siberia.’ 
Therefore, it is likely that many individuals 
are influenced significantly by the social mean- 
ing of the work rather than the satisfaction 
they personally get from it. 

No statistical report is being made on ac- 
counting clerks because not enough subjects 
were included in the cross-validation group to 
permit the predictions to be tested adequately. 
The above average accounting clerks included 
within the experimental group scored high on 
the acceptance of standards and had moderate 
scores on the other elements. 


SUMMARY 


In this paper it has been reported that a 
questionnaire designed to do something else 
has proved to be reasonably successful at pre- 
dicting success or failure in various Foreign 
Service clerical occupations. Seven behavioral 
elements which relate to ways of responding 
to authority, ways for dealing with people, and 
level of use of intellect appeared to be sug- 
gested by the data. It seems likely that the 
effectiveness of the questionnaire can be im- 
proved by redesigning it specifically to meas- 
ure the behavioral style and capability ele- 
ments which have emerged from this study. 
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ABILITY TO FOLLOW INSTRUCTIONS AS A PREDICTOR 
if OF SUCCESS IN RECRUIT TRAINING’ 


FERDINAND STERN ? 


United States Naval Training Center, San Diego 


and LEONARD V. GORDON 


i United States Naval Personnel Research Field Activity, San Diego 

i Essentially, the purpose of recruit training fication Battery. The present study was un- 
i in the Navy is to effect an orderly and pro- dertaken to determine whether direction-fol- 
‘ gressive adjustment to military life and to lowing ability would also be predictive of the 
: provide, through information and practice, recruit’s success in meeting minimal stand- 
- basic military indoctrination which will con- ards in basic military practices. 

. tribute to successful service (USN Bureau of 

Naval Personnel, 1953). In this indoctrina- PROCEDURE 

: tion, the recruit obtains information on vari- The sample in the present study consisted of 563 


ous topics through lectures, films, and demon- 
strations. In addition, he receives training in 
such military practices as barracks care, cloth- 
ing care, personal cleanliness, and basic mili- 
tary drill. 

The degree to which the recruit acquires 
information is assessed through weekly writ- 
ten quizzes and an objective examination 
given at the end of recruit training, and is 
highly predictable by certain aptitude meas- 
ures in the Navy Classification Battery. His 
progress in acquiring efficiency in the funda- 
mental military practices is subjectively evalu- 
ated on a continuing basis by his company 
commander, and is predicted with only very 
moderate success by Navy Classification Bat- 
tery scores. Since early identification of indi- 
viduals who will fail to meet requirements in 
military practices is desirable, some relevant 
predictive factors other than those presently 
measured by the Navy Classification Battery 
must be found. 

Gordon and Alf (in press) found that, for 
a sample of school-eligible recruits, ability to 
follow directions in marking a Navy interest 
inventory was significantly related to the re- 
cruit’s acquisition of information during re- 
cruit training and was independent of verbal 
intelligence as measured by the Navy Classi- 

1 Opinions expressed herein are those of the authors 
and do not necessarily reflect those of the Depart- 
ment of the Navy. 

Now at Central State Hospital, Waupun, Wis- 
consin. 


recruits, 511 of whom had graduated from recruit 
training, and 52 of whom had been discharged for 
reasons of inaptitude prior to completion of the 
training period. Individuals discharged for medical 
or disciplinary reasons were not included in the 
sample. 

All recruits had been given the Navy Classification 
Battery during the morning of their third day in the 
service. This battery contains four tests: (a) the 
General Classification Test (GCT) consisting of 
analogy and sentence completion items; (b) the 
Arithmetic Test (ARI) containing computational 
and arithmetic reasoning items; (c) the Mechanical 
Test (MECH) consisting of tool and electrical knowl- 
edge, and mechanical comprehension items; and (d) 
the Clerical Test (CLER), a-test of number-match- 
ing ability. 

On the morning of the third day, prior to the ad- 
ministration of the Navy Classification Battery, a 
test of ability to follow directions was experimen- 
tally administered. The test, the Oral Direction Test 
(ODT) (Langmuir, 1952), provides a measure of the 
functional efficiency of the individual in understand- 
ing what he is told to do. It consists of a series of 
instructions presented orally by means of a recording 
The examinee is to carry out each instruction on his 
answer sheet. 

Two criteria were used in the present study. The 
first was the dichotomous criterion of inaptitude dis- 
charge-graduation from recruit training. Inaptitude 
discharges were given to recruits unable to achieve 
minimum standards in military practices such as 
marching, physical exercise, personal cleanliness, and 
barracks care, and whose conduct created various 
disciplinary problems. The second criterion was per- 
formance on the Recruit Final Achievement Test. 
This objective examination is administered at the 
end of recruit training to measure information ac- 
quired during the course of training. 

Biserial correlation coefficients were obtained be- 
tween all predictor variables and the discharge- 
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graduation criterion. Product-moment correlations 
were obtained between all predictors and the Re- 
cruit Final Achievement Test scores for graduating 
recruits. 


RESULTS 


Table 1 presents means, standard deviations, 
and validities of the Oral Directions Test and 
Basic Test Battery variables against the dis- 
charge-graduation criterion. All validity co- 
efficients are significantly different from zero 
well beyond the 1% level of confidence. The 
Oral Directions Test has the highest biserial 
correlation with the discharge-graduation cri- 
terion. 

Table 2 presents validities of all predictor 
variables against the Recruit Final Achieve- 
ment Test (RFAT) criterion for the graduat- 
ing recruits. All product-moment coefficients 
of correlations are significantly different from 
zero beyond the 1% level of confidence. For 
this group, and with this criterion, the Gen- 
eral Classification Test has the highest va- 
lidity. The validities of the Arithmetic Test, 
Mechanical Test, and Oral Direction Test are 
moderate and of about equal magnitude. 


DISCUSSION 


The results of the present study suggest 
that a measure of the ability to follow in- 
structions may be fairly highly predictive of 
one type of success in recruit training but not 
of another. In the initial part of recruit train- 
ing, much emphasis is placed upon the de- 
velopment of simple habits of cleanliness, 
neatness, obedience, and the like. Individuals 
who are poor at understanding or following 
oral instructions are unlikely to grasp these 


TABLE 1 


MEANS, STANDARD DEVIATIONS, AND BIsERIAL Cor 
RELATIONS BETWEEN PREDICTOR VARIABLES AND 
DIscCHARGE-GRADUATION CRITERION 
(N = 563) 


tr 
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Variable 
GCT 
ARI 
MECH 
CLER 
ODT 


TABLE 2 
PropUCT—-MOMENT CORRELATIONS AMONG PREDICTOR 
VARIABLES AND THE RECRUIT FINAL ACHIEVEMENT 
TEsT 
(N = 511) 


ARI MECH CLER 


Variable ODT 
GCT 

ARI 

MECH 

CLER 

ODT 


fundamentals with sufficient rapidity to en- 
able them to meet service standards. 

On the other hand, the ability to follow in- 
structions, as measured by the Oral Directions 
Test, is only very moderately predictive of 
the individual’s ability to reproduce, on a 
final achievement test, facts learned during 
recruit training. It contributes nothing, for 
example, to verbal intelligence as measured by 
the Genera! Classification Test, the multiple 
correlation having the value of .58 which is 
identical with the validity of the General 
Classification Test alone. 

The General Classification Test, while only 
moderately predictive of the inaptitude dis- 
charge criterion, is quite successful in predict- 
ing Recruit Final Achievement Test perform- 
ance. The verbal content of the General Clas- 
sification Test, the Recruit Final Achievement 
Test, and the training media undoubtedly ac- 
count for the substantial relationship obtained. 

A comparison of the results of the present 
study with those reported by Gordon and Alf 
is of interest. In the latter study, a median cor- 
relation of .35 was obtained between number 
of errors made in marking an interest inven- 
tory and Recruit Final Achievement Test 
scores; in the present study a correlation of 
.36 was found between Oral Direction Test 
scores and the same criterion. The measures 
of direction-following ability used in the two 
studies differed in their obtained relationship 
to verbal intelligence. The number of errors 
made on the interest inventory was unrelated 
to scores on the General Classification Test, 
while the Oral Direction Test scores had a 
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significant relationship with this measure. 
This lack of relationship in the Gordon and 
Alf study may have been due to restriction in 
range resulting from their use of a school- 
eligible sample. 

The results of the present study indicate 


that a test of ability to follow instructions ap- , 


pears to have value in predicting a significant 
part of a particular criterion complex. Fur- 
ther exploration of the utility of such a test in 
the military services appears to be warranted. 
Whether such a test would have equivalent 


value in industrial settings would have to be 
situationally determined. 
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A PSYCHOLOGICAL INVESTIGATION OF A GROUP OF 
DEMOGRAPHIC, PERSONALITY, AND BEHAVIORAL 
VARIABLES AS THEY RELATE TO VIEWING 
EDUCATIONAL TELEVISION 


RICHARD I. EVANS 
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Recent studies by Asher and Evans (1959) 
and Evans, Wieland, and Moore (1961) 
showed evidence of changes in attitudes as a 
result of viewing a specific educational tele- 
vision program under controlled conditions 
with selected samples. The impact of educa- 
tional television (ETV) in general, however, 
is more difficult to evaluate. As previously 
suggested by the writer (1957), size of audi- 
ence is not a sufficient criterion. Another pos- 
sible criterion is the degree to which programs 
seen are discussed with others. Such a hori- 
zontal interaction pattern would indicate im- 
pact beyond the group of individuals actually 
watching ETV. 

Earlier studies of educational television au- 
diences (Adams, 1956; Crary, 1957; Evans, 
1957; Geiger & Sokol, 1957; Merrill, 1957; 
Schramm, 1957) have been principally con- 
cerned with analysis of demographic charac- 
teristics; only peripheral attention has been 
paid to personality and behavioral charac- 
teristics. The study previously cited (Evans, 
1957), however, suggested that activity pat- 
terns of frequent viewers of educational tele- 
vision might differ significantly from those of 
nonviewers. Frequent viewers often appeared 
to have a self-improvement orientation, mani- 
fested in such behavior as attending lectures 
and concerts, which was less noticeable in non- 


viewers. This observation suggests the possi- 
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bility of underlying personality differences be- 
tween the two groups. 

The present study was designed to explore 
some of the possible differences which were 
suggested by earlier research between frequent 
viewers and nonviewers of educational tele- 
vision. It was decided to examine more thor- 
oughly apparent differences in behavior of the 
two groups and to investigate the possibility 
of underlying personality differences. The re- 
search design also called for a further com- 
parison of the two groups with respect to cer- 
tain demographic characteristics. Specifically, 
the present study sought to answer the fol- 
lowing questions: 

1. Do frequent viewers and nonviewers of 
KUHT-TV (the University of Houston tele- 
vision station) differ with respect to occupa- 
tion, level, educational 
level, The study 
(Evans, indicated with 
respect to these characteristics. 


socioeconomic SeX, 


race, or age? 


1957) 


previous 
no difference 


2. How do the two groups compare with 
respect to certain measures of personality? 
(a) Are nonviewers of KUHT-TV more likely 
to be dogmatic than frequent viewers? Rok- 
(1954) has defined a dogmatic indi- 
vidual as one who is inclined to be intoler- 


each 


ant, rigid in his beliefs, opposed to new ideas, 
and closely allied to what he considers posi- 
tive authority. Because educational television 
may be considered a possible source of dis- 
turbing ideas, it was hypothesized that few 
dogmatic individuals would be frequent view- 
ers, since they would tend to avoid exposure 
to information which might threaten their be- 
lief systems. (6) Are nonviewers more likely 
to be authoritarian than frequent viewers? 
An authoritarian personality, as defined by 
Adorno, Frenkel-Brunswik, Levinson, and San- 
ford (1950), tends to be anti-intraceptive, 
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ethnocentric, conventional, politically con- 
servative, and responsive to external rather 
than internal authority. It was hypothesized 
that authoritarians would be less likely than 
equalitarians to watch educational television, 
since it might be thought of as endangering 
existing social practices. (c) Can the two 
groups be differentiated in terms of achieve- 
ment motivation? As defined by McClelland, 
Atkinson, Clarke, and Lowell (1953), need 
for achievement denotes an intrinsic, some- 
times compulsive, drive for accomplishment 
which dominates a large portion of the indi- 
vidual’s activity. Since viewing ETV might 
be one means of fulfilling a need for achieve- 
ment, it was hypothesized that frequent view- 
ers might have a higher need for achievement 
than nonviewers. 

3. How do the two groups compare with 
respect to certain types of behavior? (a) Do 
frequent viewers and nonviewers differ in vot- 
ing frequency? Results of the earlier study 
(Evans, 1957) pictured the frequent ETV 
viewer as civic-minded; he might, therefore, 
be expected to vote more regularly than a non- 
viewer. (4) Are there any significant differ- 
ences between the two groups in leisure time 
activity preferences? If frequent viewers do 
have a more marked self-improvement ori- 
entation than nonviewers, as suggested above, 
this might be reflected in their leisure time 
activities. (c) Do frequent viewers have more 
friends who watch and discuss programs on 
KUHT-TV than nonviewers? If so, what 
types of programs and what topics are most 
discussed, and is the information thus ob- 
tained considered valuable? 

4. Can the total impact of educational tele- 
vision on viewers and on the community be 
evaluated? Is there, for example, any evi- 
dence of a sociometric horizontal impact pat- 
tern associated with educational television 
viewing? 


SAMPLING 


The present sample was chosen from a 
larger sample used in a previous study (Evans, 
1957). The original sample consisted of 500 
television homes in the greater Houston area, 
randomly selected using a “grid density” pro- 
cedure developed by Houghland (1956), and 
subdivided into frequent viewers, infrequent 
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viewers, former viewers, and nonviewers of 
KUHT-TV. Since the original interviews were 
conducted in May 1956, and the present in- 
terviews between July 15 and August 15, 
1957, it was expected that some changes 
would have occurred in viewing frequency 
during the interval of a year. For the present 
study frequent viewers were defined as those 
who watched one ETV program per week, and 
nonviewers were defined as those who had not 
watched ETV within the last 6 months. Be- 
cause of the relatively small audience for 
ETV, these definitions were necessary in or- 
der to obtain an adequate sample for analy- 
sis. On the basis of these definitions, 100 fre- 
quent viewers and 100 nonviewers were se- 
lected from the original sample. Interviews 
were completed with 137 members of this 
group; the remainder could not be contacted. 
Because some of the interviewees selected as 
frequent viewers on the basis of the earlier 
study could no longer be so classified, final 
results were based on interviews with 42 fre- 
quent viewers and 72 nonviewers. Although 
the sample was necessarily limited, it was stil] 
reasonably representative of frequent viewers 
and nonviewers of ETV in the greater Hous- 
ton area. 


METHODOLOGY 


The questionnaire used in the present study in 
cluded fixed-alternative, general open-end, and pro- 
jective questions. 

As personality measures, modified versions of the 
Dogmatism (D) Scale of Rokeach (1954), the au- 
thoritarian F Scale of Adorno et al. (1950), and the 
Need for and Value of Achievement Scales (nAch 
and vAch) of McClelland et al. (1953) were used 
Shorter forms of these scales were developed in or- 
der to insure sufficiently high response motivation 
throughout the interview. Certain D- and F-Scale 
items were reworded to avoid fallacious responses 
that might result from acquiescent tendencies or 
other types of response set. This control measure 
was suggested by Christie, Havel, and Seidenberg 
(1958), who demonstrated the reversibility of F- 
Scale items and analyzed the acquiescent factor origi- 
nally involved in this measure of authoritarianism. 

Several open-end projective questions reported by 
Adorno et al. (1950) were used to gain added in- 
sight into authoritarianism. Responses to these ques- 
tions were coded in terms of the suggested cate- 
gories. Three judges estimated a cumulative index of 
authoritarianism for each respondent, based on F- 
Scale scores and coded responses to the projective 
questions. This index was used in the statistical 
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analysis. Similarly, a cumulative index of dogmatism 
for each individual was based on the several D-Scale 
scores. Achievement motivation level for each re- 
spondent was based on four open-end questions. Be- 
havioral information was sought by means of sev- 
eral fixed-alternative questions, plus general open- 
end questions regarding social interchange with 
friends on the subject of KUHT-TV programs. 

Six trained interviewers, including Baylor Univer- 
sity medical students and University of Houston psy- 
chology graduate students, interviewed respondents 
in 137 television homes during the period from July 
15 to August 15, 1957. Where possible, the interview 
material was precoded for IBM analysis. A group of 
project assistants served as content analyzers for the 
open-end projective questions. The usual reliability 
checks were made among raters. A final response 
code was established by the writer in conference with 
the assistants. Information from the source sheets 
was then punched on the IBM cards. Using the chi 
square method of spanning the significance of inter- 
group differences, responses to the questions were 
compared to determine whether significant differ- 
ences existed between frequent viewers and non- 


viewers of KUHT-TV. 


RESULTS 


The first problem considered in the present 
study was the relationship of a group of 
demographic variables to ETV viewing. No 
significant differences were indicated with re- 
spect to occupation, income, sex, and race. 
The age category, with a chi square of 7.69 
and two degrees of freedom, yielded a con- 
fidence level of .05, indicating that frequent 
viewers tend to be significantly older than 
nonviewers. More specifically, based on three 
age group categories of 0-29 years, 30-50 
years, and 50 plus years, the analysis sug- 
gested that educational television viewing in- 
creases in the 30-50 age group and reaches its 
peak in the 50 plus age group. 

The second problem considered was the re- 
lationship between viewing frequency and a 
group of personality variables, namely, Au- 
thoritarianism, Dogmatism, and Need for 
Achievement. Analysis of the data did not 
differentiate between the two groups with re- 
spect to any of these variables. 

The third problem, concerning behavioral 
variables, yielded the most interesting results. 
Statistically significant differences were found 
between frequent viewers and nonviewers with 
respect to voting behavior, leisure time ac- 
tivity preferences, and discussion of ETV 
programs. 
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With regard to voting, 75% of the sample 
of frequent KUHT-TV viewers reported that 
they voted in every election, or nearly every 
election, whereas only 40% of the sample of 
nonviewers stated that they voted with such 
regularity. The chi square difference between 
the two groups of 14.35 with two degrees of 
freedom proved to be significant at the .001 
level of confidence, showing a highly signifi- 
cant tendency for frequent viewers to vote 
more frequently than nonviewers. 

Another interesting clue to behavioral dif- 
ferences between the two groups was found in 
answers to the question, ““How would you best 
like to spend a leisure evening?’’ Three cate- 
gories of answers were coded: (a) passive 
noninvolving activity, which included such 
spectator recreations as watching television, 
going to movies, and reading fiction; (4) pas- 
sive involving activity, which included such 
information-value recreations as attending a 
lecture and reading nonfiction; (c) partici- 
pative activity, which included such social 
recredtions as going to a party and playing 
cards. With respect to this group of variables, 
the chi square difference between frequent 
viewers and nonviewers was significant at the 
.05 level of confidence, thus conforming to 
our criterion of statistical significance. The 
frequent viewer of educational television pre- 
ferred information-value recreations such as 
attending lectures and reading nonfiction, 
whereas the infrequent viewer preferred social 
recreations such as going to a party and play- 
ing cards. 

Portions of the present questionnaire were 
designed to help estimate the extent and im- 
pact of program discussions by viewers of 
educational television. The respondent was 
asked if his friends watched KUHT-TV, if he 
discussed the programs on KUHT-TV or top- 
ics related to these programs, and if such dis- 
cussions had been of value. Significantly more 
respondents who were themselves frequent 
viewers of KUHT-TV had friends who 
watched KUHT-TV and who mentioned pro- 
grams seen, or discussed topics related to 
these programs. The differences between fre- 
quent viewers and nonviewers on these ques- 
tions were significant at the .01, .001, and .01 
levels of confidence, respectively. 

These findings suggest that the frequent 
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viewer may discuss educational television with 
other viewers more often than he promotes it 
to nonviewers. This is not to say, however, 
that there is no conversational contact be- 
tween frequent viewers and nonviewers. It is 
to be noted that more than a third of the non- 
viewers did have friends who watched and 
discussed KUHT-TV. It may be, moreover, 
that those who do not watch educational tele- 
vision regularly do not know whether or not 
their friends watch it, and do not recognize, 
as such, topics that are related to ETV pro- 
grams. In support of this hypothesis, inspec- 
tion of the responses does show that nonview- 
ers used the “don’t know” response more fre- 
quently than frequent viewers in answer to 
the question, “Do any of your close friends 
watch KUHT-TV?” 

Other questions directed toward evaluating 
the impact of educational television were con- 
cerned with whether or not the respondent 
felt that the programs he had watched or dis- 
cussed had helped him in any way. One ques- 
tion referred to programs actually watched 
during the week of the interview, and a second 
referred to programs or related topics brought 
up in conversation with friends. Interestingly 
enough, among frequent viewers, almost as 
many affirmative responses were given to the 
latter as to the former question. A reason for 
this could be that the respondent generalized 
his answers to cover both questions. This 
would be the case, especially, with those view- 
ers who had not watched many programs that 
particular week. But the principal conclusion 
to be drawn is that discussion among viewers 
is a significant factor in educational televi- 
sion’s total impact on the community and 
should be considered in evaluation. 

Nonviewers generally were less inclined than 
regular viewers to feel that they had benefited 
from discussions of ETV programs or topics. 
This difference, with a chi square of 3.52 and 
one degree of freedom, approaches significance 
at the .07 level of confidence. This may be 
partly explained, however, by the fact that 
nonviewers have not engaged in as much dis- 
cussion of educational television programs as 
frequent viewers. Among nonviewers who did 
report that friends had mentioned some aspect 
of educational television, 60% felt that this 
information had been of some value. In the 


case of frequent viewers, 85% felt that in- 
formation gained from ETV programs had 
helped them in some way. 

Analysis of the responses concerning pos- 
sible personal benefits derived from actually 
watching or merely discussing KUHT-TV 
programs showed little difference between the 
impact of these two types of exposure to ETV 
program content. One reason for this was 
the tendency for respondents, especially fre- 
quent viewers, to discuss the informative 
value of particular programs they had actu- 
ally watched, even in answer to the question 
which sought to determine whether or not the 
respondent had been helped by discussions of 
educational television topics in general. 

Since an important part of the total im- 
pact of educational television programs seems 
to involve post-program discussion, an analy- 
sis was made to determine how the various 
programs compared with respect to the ex- 
tent that they were discussed by viewers. The 
regularly televised bimonthly meeting of the 
Houston School Board appeared to be the 
most discussed program. More than a fourth 
of those people who had heard discussions of 
KUHT-TV mentioned this program by name. 
Among frequent viewers of KUHT-TV, 40% 
stated that they had either watched the school 
board program the week of the interview or 
had discussed the program at some time. 

The results presented here indicate that fur- 
ther study of ETV audiences with respect to 
behavioral characteristics would be of value. 
The possibility that frequent viewers of edu- 
cational television may possess some distin- 
guishing personality characteristics should 
also be explored further, despite the negative 
results to date. The fact that the frequent 
viewer does appear to differ significantly from 
the nonviewer with respect to behavioral char- 
acteristics strongly suggests some underlying 
personality differences, which more sensitive 
instruments might reveal. 

Further research on the total impact in 
depth and breadth of educational television is 
also indicated. The development of adequate 
measures of impact is in itself an important 
problem. It may be that a viewer-panel type 
of instrument, such as that used in the previ- 
ously cited study by the writer (1957), would 
be more useful for estimating impact than a 
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unilateral questionnaire of the type used in 
the present study. 

In conclusion, it should be pointed out that 
the results of the present study are encourag- 
ing for the educational television movement. 
Our findings indicate that frequent viewers 
discuss and act upon ideas and information 
gained from educational television programs. 
Such viewers could be instrumental in con- 
tributing to the success of this social move- 
ment. 
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All managers in industrial concerns, with 
the exception perhaps of the very highest 
echelon, must integrate their roles as sub- 
ordinates and superiors. The literature, in 
general, implies that the integration process 
is heavily weighted in terms of satisfying the 
demands of the immediate superior rather 
than the immediate subordinates (Dubin, 
1951; Henry, 1948; Herzberg, Mausner, 
Peterson, & Capwell, 1957; McGregor, 1944). 
The reasons proposed for such upward ori- 
entation revolve either around a high upward 
mobility drive (Henry, 1948) or the auto- 
cratic nature of the business organization 
(Dubin, 1951; McGregor, 1944). 

Insofar as supervision, i.e., the implementa- 
tion of decision through the direction of ac- 
tivity of subordinates, is one of the primary 
functions of management, it is difficult to 
visualize effective supervision being based 
upon a relative rejection of subordinate de- 
mands. Although the manager may be under 
the direct sanctioning power of his immediate 
superior, the effectiveness with which he car- 
ries out his duties and upon which he is evalu- 
ated largely is dependent upon the effec- 
tiveness of his subordinates. The subordinate, 
then, has sanctioning power as well. Conflict 
theoretically, however, need not occur as long 
as the managerial subordinate is a manager 
possessing the upward orientation that char- 
acterizes hig superior. 

Unfortunately, in the present state of knowl- 
edge about the dynamic factors characterizing 
a management hierarchy, one can do little 
more than speculate. This research is an at- 
tempt to study the interactions of one level 
of management (division managers) with its 
superior (the plant manager) and its subordi- 
nates (section managers). 


Subjects 


This study was conducted in a moderately 
sized industrial plant located in a small mid- 
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western urban center. The subjects included 
the plant manager, seven division managers 
directly under his supervision, and 37 section 
managers responsible to members of top man- 
agement. All subjects had a supervisory func- 
tion as part of their role obligation. As far as 
could be determined through interviews with 
the participants, the plant manager was pri- 
marily an integrator and policy maker. Divi- 
sion and section management had policy mak- 
ing functions, but within a relatively narrow 
sphere. Of primary importance to them were 
their responsibilities in implementing the de- 
cisions of their respective superiors. 


METHOD 


The instrument utilized in this study was a multi- 
ple ranking device. The stimuli to be ranked were 
made up of 16 occupational role prescriptions (listed 
in first column of Table 2 below), primarily with 
reference to the human relations aspects of super- 
vision, garnered from the literature and from an ear- 
lier study (Rosen & Rosen, 1957). These stimuli were 
ranked by the subjects in order of their importance 
under the following restrictive conditions: 

1. “Qualities men in your position should have,” 
i.e., personal role evaluations 

2. “Qualities you would most like to see in your: 
(a) immediate superior(s) and (b) immediate sub- 
ordinate(s),” demands upon others 

3. “Qualities you think your immediate (a) su- 
perior(s) or (b) subordinate(s) would feel are most 
desirable for men in your position,” ie., prediction 
of demands of others 

Division management ranked the stimuli under all 
conditions. Their superior, the plant manager, com- 
pleted the personal role evaluations, demands upon 
subordinates, and prediction of subordinate demands. 
The middle management group completed the per- 
sonal role evaluations, demands upon superior, and 
prediction of superior demands. 

Rank data were combined for each managerial level 
into an average rank for each of the role variables.? 
Interrelationships between variables were then ana- 
lyzed by means of Spearman rho. The resultant cor- 
relations are presented in Table 1. 


1 With the management classes, the average rank 
scores were characterized by small standard devia- 
tions; consequently, the classes were treated as units 
due to their response homogeneity. 
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TABLE 1 


SPEARMAN RHO CORRELATIONS BETWEEN ROLE VARIABLES 
FOR THREE MANAGERIAL LEVELS 


Prediction of Demands vs. 
Personal Role Evaluations 
Division Management 
Prediction of Superior vs. Personal Role Evaluations 
Prediction of Subordinate vs. Personal Role Evaluations 
Plant Manager 
Prediction of Subordinate vs. Personal Role Evaluations 
Section Management 
Prediction of Superior vs. Personal Role Evaluations 


Accuracy of Predictions 
Division Management 
Prediction of Superior vs. Superior Demand 
Prediction of Subordinate vs. Subordinate Demand 
Plant Manager 
Prediction of Subordinate vs. Subordinate Demand 
Section Management 
Prediction of Superior vs. Superior Demand 


Adequacy of Personal Role Evaluations 
Division Management 
Personal Role Evaluations vs. Superior Demand 
Personal Role Evaluations vs. Subordirflate Demand 
Plant Manager 
Personal Role Evaluations vs. Subordinate Demand 
Section Management 
Personal Role Evaluations vs. Superior Demand 


Personal Role Evaluation and Demands 
upon Superior(s) and Subordinates 
Division Management 
Personal Role Evaluations vs. Demand upon Superior 
Personal Role Evaluations vs. Demand upon Subordinate 
Plant Manager 
Personal Role Evaluations vs. Demand upon Subordinate 
Section Management 
Personal Role Evaluations vs. Demand upon Superior 


Relationships among Role Demands 
of Three Managerial Levels 

Division vs. Section Management 
Demands upon Superior 

Division Management vs. Plant Manager 
Demands upon Subordinate 


Relationships among Personal Role 
Evaluations of Three Levels 

Division Management vs. Plant Manager 

Division Management vs. Section Management 

Section Management vs. Plant Manager 


Note.—Significance of Spearman r} tsing one-tailed test (Siegel, 1956 
* At .05 level: p =.425 
*™ At .O1 level: p =.601. 
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RESULTS AND DISCUSSION 


Prediction of Demands and Personal Role 
Evaluators 


From Section I, Table 1, it is clear that per- 
sonal role evaluations of division management 
were significantly related to their prediction 
of both superior and subordinate demands. It 
should be noted, however, that their personal 
role evaluations were more highly reiated to 
their predictions of superior than subordinate 
demands. Corollary information supported 
these trends. The plant manager’s personal 
role evaluation was moderately and _ signifi- 
cantly related to his predictions of subordi- 
nate demands. Section management’s personal 
role evaluations were significantly related to 
their prediction of superior demands. 

In general, these results suggest that there 
is a tendency to perceive that personal role 
evaluations are highly comparable to superior 
demands, whether or not this position is valid. 
This may be taken as partial evidence for the 
upward orientation of management personnel 
proposed in the literature. Of equa! impor- 
tance, however, are the results that suggest 
that an upward orientation does not preclude 
some recognition of subordinate demands, 
i.e., upward orientation is a matter of degree 
and does not imply a rejection of subordinates 
by management personnel. 


Accuracy of Predictions: Prediction of De- 
mands versus Actual Demands 


Section II, Table 1, indicates that division 
management’s predictions of their superior’s 
demands were not significantly related to the 
actual demands. Predictions with regard to 
subordinate demands, however, related to the 
actual demands to a moderate, but significant 
degree. Corollary data do not support these 
trends. The section management’s prediction 
of superior demands were highly related to 
the actual demands, whereas the plant man- 
ager’s prediction of subordinate demands were 
not significantly related to the actual demands. 

These results suggest that some blockage to 
effective communication may have been pres- 
ent in the interaction between the plant man- 
ager and his immediate subordinates, the divi- 
sional managers, for neither level could pre- 


dict the demands of the other to a significant 
degree. 


Adequacy of Personal Role Evaluations 


Section III, Table 1, indicates that the per- 
sonal role evaluations of the divisional man- 
agement level were not significantly related to 
the actual demands of their superior, but were 
highly related to the actual demands of their 
subordinates. Again, corollary data do not 
support these trends. Section management’s 
personal role evaluations were highly com- 
parable to the actual demands of their su- 
periors, whereas the plant manager’s personal 
role evaluations were not significantly related 
to the actual demands of their subordinates. 
Stated somewhat differently, division and sec- 
tion management levels’ personal role evalua- 
tions were highly related to each other’s de- 
mands, whereas those of the plant manager 
and division management were not. 

This suggests, in the light of the earlier 
data, that fulfilling the role demands of those 
in interlocking managerial levels may be 
largely a function of the accuracy of pre- 
dictions rather than the desire to meet the 
demands of one level more than those of an- 
other. Moreover, it tends to confirm the hy- 
pothesis that there may be barriers to effec- 
tive interaction between the plant manager 
and division management. 


Personal Role Evaluation and Demands upon 
Superior(s) and Subordin.tes 


It is clear from Section IV, Table 1, that 
the demands management makes upon their 
superior({s) and subordinates is largely a 
function of how they evaluate their personal 
roles. This suggests that managers do not con- 
sider that supervisory style is a function of 
the position of the manager within the hier- 
archy, but rather that “managing” calls for a 
common pattern of attributes. 


Relationships among Role Demands of the 
Three Managerial Levels 


Section V, Table 1, indicates that division 
and section. management have much in com- 
mon re the demands they make upon their 
immediate superior(s), whereas the plant 
manager’s and the division management’s de- 
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RELATIVE IMPORTANCE OF SIXTEEN MANAGERIAL 
RoLeE EVALUATION FOR GENERAL, 


Managerial Role Prescription 


Knows job thoroughly 
Plans in terms of the future 

. Takes pains to make himself understood 
Gives credit where credit is due 
Judges subordinates in terms of merit only 
Maintains discipline fairly 


Liberal but consistent in enforcing company 


Takes personal interest in subordinates 
Takes responsibility upon himself 

Fights for what he believes ir 

Cooperative regarding suggestions made to 
Keeps promises to subordinates to the best 
of his ability 

Keeps subordinates informed of what is gol 
Explains the reasons 
Investigates each pro 
making a decision 


Leads rather than drives his subordinates 


mands upon those in immediate subordinate 
positions are not significantly related. Given 
adequate communication of demands, this 
would suggest that within this organization a 
common supervisory style would evolve from 
the division management level down, but that 
the advancing manager having reached the 
level of divisional management would be at 
odds with his superior. 


Relationships among Personal Role Evalua- 


tions of the Three Levels 


Section VI, Table 1, indicates that the three 
managerial levels studied tended to evaluate 
their respective occupational roles in a signifi- 
cantly comparable manner. It should be noted, 
however, that there was a considerably higher 
degree of commonality between section and 
division managements’ personal role evalua- 
tions than between either level and the plant 
manager. 

To the extent that personal role evaluations 
provided the bases for both prediction of de- 
mands and demands made upon 
those in other levels, the relatively greater 
comparability between section and division 


of others 


managements than between division manage- 


lop, 


E 2 
ROLE PRESCRIPTIONS IN TERMS OF PERSONAL 
AND MANAGEMENT GROUPS 


Top Mgt Gen. Mgt Mid. Met 


him 


ng on 
vehind each of his orders 


thoroughly before 


ment and the plant manager are understand- 
able. Given a poor communication of role de- 
mands between management levels, the effect 
would be more pronounced between the divi- 
sion management and the plant manager with 
relatively unrelated personal role evaluations 
than between section and division manage- 
ments having relatively highly related per- 
sonal role evaluations. 

In part, the discrepancies between the plant 
manager and the managers in the two lower 
echelons with regard to personal role evalua- 
tions may be due to differential role demands 
associated with their positions in the hier- 
archy. Inspecting the raw ranks in Table 2, 
one will note that 
counted for most of the discrepancy. The 
plant manager put considerably less stress on 
“job knowledge” (see Item 1) and consider- 
ably more stress on ‘taking personal interest 
in subordinates” (see Item 8) and “explain- 
ing reasons behind orders given to subordi- 
nates” (see Item 14). It is possible that the 
plant manager in the role of generalist and 
integrator may have been somewhat deprecia- 


three prescriptions ac- 


tory of such a simple-minded prescription as 
“job knowledge” when, in effect, his major 
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problem is that of utilizing information of 
others and arriving at solutions to problems. 
With diversified issues to be settled, the only 
commonality is decision making or problem 
solving—difficult to classify as “job knowl- 
edge.” Lower echelon management, having 
less of an integrative function and more of a 
technical function as a consequence may have 
assessed “job knowledge” as being relatively 
more important. 

The two other variables that the plant man- 
ager upgraded in importance relative to his 
subordinates are almost textbook examples of 
“good human relations” approaches. In this 
case, the plant manager seemed to be more 
in the avant-garde of the human relations 
movement than the lower echelons of man- 
agement. The plant manager having reached 
the peak organizationally, and in Katona’s 
(1951) terms; perhaps having identified him- 
self with the organization to a greater degree 
than did his subordinates, may have put more 
weight on variables leading to organizational 
welfare. 


SUMMARY 


A study was conducted with three levels of 
management in a moderately sized industrial 
plant to provide data relevant to the dynamic 
aspects of managerial role interaction. The 
data suggest that the manager has an upward 
orientation toward the demands of his su- 
perior, although in so doing, he does not seem 
to reject the demands of his subordinates. 
The effectiveness of such an orientation upon 
developing a common value system re super- 
visory style, is highly dependent upon the ac- 


curacy of prediction of demands. In cases 
where predictions paralleled the actual de- 
mands, the impact was apparent, where pre- 
dictions were erroneous, the impact upon per- 
sonal evaluations was negligible. The tend- 
ency to expect of those in other levels what 
one demands of self again tends to provide a 
necessary but not sufficient condition for de- 
veloping a common set of values re super- 
vision among various levels of management. 
Finally, it was suggested that in the absence 
of adequate role demand communications, 
differential demands due to position in the 
management hierarchy may have in part ac- 
counted for the relatively lower commonalities 
in supervisory values between the plant man- 
ager and those in subordinates positions. 
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ATTITUDES TOWARDS NEWSCASTERS AS MEASURED 
BY THE SEMANTIC DIFFERENTIAL: 
A DESCRIPTIVE CASE 


PAUL M. KJELDERGAARD ! 


Harvard University 


The semantic differential is a combination 
of word association and scaling techniques 
developed by Charles Osgood of the Univer- 
sity of Illinois in conjunction with his work 
on verbal behavior (Osgood, 1952). Concepts 
are rated on seven-point scales bounded by 
polar adjectives, e.g., Good-Bad, Kind-Cruel, 
etc. Adverb modifiers qualify each step on the 
scale; the greater the intensity of the asso- 
ciation, the more extreme the displacement 
towards one or the other polar term. Each 
concept is rated on several scales so that the 
semantic profiles of the concept can be 
plotted. By computing Ds (the generalized 
distance function in m-dimensional space) 
the geometrical relationships between con- 
cepts or “meaning” may be found (Osgood & 
Suci, 1952). 

In the first use of the technique in its 
present form, Stagner and Osgood (1946) 
measured the changes in attitude toward cer- 
tain stereotypes as this country became in- 
volved in World War IT. Successive groups of 
subjects rated such concepts as Pacifist, Rus- 
sian, and Dictator at various times between 
April 1940 and March 1942. The instrument 
proved sensitive to the predicted changes in 
the “meaning” of the social stereotypes. In 
addition it was found that certain polar 
Fair-Unfair, Kind-Cruel, High- 
Low correlated very highly with each other, 
yet were relatively independent of other 
e.g., Strong-Weak, Happy-Sad. This 
latter finding led to the use of factor analysis 
in conjunction with the semantic differential. 

The potentiality of the instrument in opin- 
ion research was shown in a study by Osgood, 
Suci, and Tannenbaum (1957) in which the 
semantic differential was used in conjunction 
with a public opinion poll conducted in the 


scales, e.g., 


scales, 


‘ The author wishes to express his appreciation to 
James J. Jenkins for his assistance in the preparation 
of this manuscript. 


Champaign-Urbana, Illinois area. A sample of 
150 subjects representative of the voting 
public of this area was tested four times dur- 
ing the summer and fall of 1952. These sub- 
jects rated 20 concepts, 10 of persons, e.g., 
Truman, Taft, and Stalin, and 10 of policies, 
e.g., use of the atomic bomb, our foreign 
policy, the New Deal. By matching the differ- 
ential profiles of the “undecided” voters with 
profiles of Stevenson or Eisenhower support- 
ers, the experimenters were able to predict 
how 18 out of 19 “undecided” actually voted 
at the polls. 

Applications have also been made in the 
field of advertising research where studies 
have been conducted on the effect of color on 
advertised products, the effect of titles on the 
significance of pictures, and the influence of 
slogans on their advertised products (Osgood 
et al., 1957). Mindak (1956) used the differ- 
ential to compare the effectiveness of certain 
advertising appeals, e.g., the scientific appeal, 
the romantic appeal, etc., used in radio com- 
mercials for a new cosmetic product. 

This brief review only suggests the versatil- 
ity and the diversity of application of the 
instrument. Many studies of theoretical im- 
portance to psychology in such fields as verbal 
behavior, personality, clinical, and the dy- 
namics of social attitudes (Jenkins, Russell, 
& Suci, 1958: Osgood & Luria, 1954: Osgood 
et al., 1957) have also employed the semantic 
differential. Other areas of application men- 
tioned by Osgood et al. (1957) are studies of 
symbolism, aesthetic judgment, and interpre- 
tation of sonar signals. 

This article deals with a novel application 
of the instrument as a selection and evalua- 
tion device in casting a new television news 
program. Of equal importance is the fact that 
this study provides a measure of the stability 
of attitude changes which were induced by 
the viewing of a new program. Thus, the re- 
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search provided additional evidence concern- 
ing the reliability and validity of the semantic 
differential in applied research. 


PROBLEM 


Station Y-TV planned to reschedule its 
late evening news broadcast in an attempt to 
win back some of the news audience it had 
lost to station Z-TV. This was to be a com- 
pletely new program with new personalities, a 
new stage setting, and the latest television 
electronic “gimmicks.” In order to get an 
audience reaction, an outside research firm 
was called upon to pretest the new program 
with an audience via a closed-circuit TV in 
the station’s studios. 

The station was interested in getting the 
audience’s reaction to the setting. the format. 
and the TV personalities employed. Special 
interest was placed on the last of these con- 
siderations, for though the chief newscaster 
to be employed was not new to television, nor 
this area, his role had been primarily one of 
master of ceremonies for various local pro- 
grams. There was some question as to whether 
he would project the maturity and knowledge 
expected of someone telecasting news. 

One other individual was to supplement 
the newscaster on the new program, an ed- 


itorialist and announcer who had been the 
newscaster at the old evening time. 
METHOD 
Two techniques were employed in gaining the 
necessary information about the audience’s reaction 


to the program and the individuals who participated 


First a minute group interview was conducted 
following the viewing of the program. Every aspect 
of the program was covered including reactions to 


the cast 

\ second technique, the semantic differential, was 
two individ- 
the 


employed to get a verbal picture of the 
well as one 

Station Z. The differential 
red both before and after the pri 
later by to measure the 
effect of the program on the verbal stereotypes held 
The I up was to 


15-week 
the stability of the program effect over a longer time 


uals mentioned above a of 


news- 


casters trom competing 


was administe gram 


ilso 15 weeks mail) so a 


follow an attempt measure 
interval 
The study was 


obtained 


planned so that s would be 
on three individuals who appeared on the 
program and on one person (X) who did not appear 
on the program. This latter individual was the most 
popular and most “listened-to” 
in the 


rating 


television newscaster 


area and in a loose sense could be considered 
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a criterion or target against which to compare other 
newsmen. Furthermore, if we assume the ratings of 
individuals are independent, then from a theoretical 
standpoint there should be no change from the pre- 
show to the postshow ratings of Individual X. Thus, 
the ratings on this newscaster provide us with an 
estimate of the reliability of the instrument. 

The 20 subjects which were called in to view th« 
new program were drawn from the consumers panel 
of a local market research organization. The group 
tended to be nonrepresentative of the population of 
this area in being more highly educated and in a 
higher income bracket than a typical listening audi- 
ence would By and large the sample was made 
up of couples rather than randomly selected indi- 
viduals. The group was given a token reward for 
having participated in the preview. 


be 


RESULTS AND DISCUSSION 


Figure 1 represents the semantic profiles 
for the three individuals (X-——the newscaster 
on .competing Station Z, A—the new news- 
caster for Station Y, and B—the editorialist 
and former newscaster for Station Y) over 
the three rating periods.* Side-to-side com- 
parisons indicate the relative merits of these 
individuals as viewed by the audience at each 
time period. Up and down comparisons show 
the changes that take place in an individual 
profile over time. 

The preshow ratings from Figure 1 demon- 
strate a number of things. First, it is clearly 
indicated that Newscaster X enjoyed tremen- 
dous popularity and is held in high esteem by 
this group. His mean ratings on 8 of the 12 
traits are 6.25 or higher, and on 3 of the 4 re- 
maining scales, extremely high ratings could 
be interpreted as probably being undesirable 
Individuals A—Station Y’s 
B—the editorialist have mean semantic pro- 
files on the preshow ratings which displace 


newscaster and 


only slightly away from the neutral position 
on most scales. There is little difference be- 
tween their profiles except that A is viewed as 


much more Warm, Pleasant. and Interesting 


than B. 

A three page table giving the means and standar 
deviations on which Figure 1 is based has be de 
posited with the American Documentation Ir ute 
Order Document No. 6550 from ADI Auxiliarv Pub 
lications Project, Photoduplication Service, Librarn 
of Congress; Washington 25, D. C., remitting in 
vance $1.25 for microfilm or $1.25 for photocopies 
Make checks payable to: Chie Photoduplication 
Servi Librarv of Congre 
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Attitudes towards Newscasters 


Pre Show 
$ 


4 

Active 
Composed 
Warm 
Aggressive 
Fast 
Friendly 
Important 
Sincere 
Pleasant 
Interesting 
Mature 


Informed 


Active 
Composed 
Waren 
Aggressive 
Fast 
Friendly 
Important 
Sincere 
Pleasant 
Interesting 
Mature 


Informed 


Active 
mposed 
Aqaressive 
Fast 
Friendly 
important 
Sincere 
Pleasant 
Interesting 
Mature 


Informed 


Individual X 


1. Mean semantic differential ratings on 


periods: before an experim 

Turning to the postshow ratings, one finds 
that marked changes have taken place as a 
result of viewing the program. Only for In- 
dividual X have the verbal stereotypes re- 
mained relatively unchanged; he suffers a 
slight regression towards the neutral position 
on traits (X = .12). 


most The close corre- 


Individual A 


three 


Ratings 
3 


Passive 
Nervous 
Cold 
Withdrawn 
Slow 
Unfriendly 
Unimportant 
Insincere 
Unpleasant 
Uninteresting 
Immature 


Uninformed 


Withdrawn 
Slow 
Untriendly 
Unimportant 
Insincere 
Unpleasant 
Uninteresting 
Immature 


Unintormed 


Insincere 
Unpleasant 
Uninteresting 
Immature 


Uninformed 


B 


Individual 


television personalitic time 


Loree 


after the program, and 15 wecks later 


spondence between his two profiles reflects 


the high reliability of the instrument—a find- 
ing consistent with those of other investigators 
(Osgood, 1952; Jenkins et al., 1958) 
Individual A’s mean rating increased o1 
each of the 12 traits. Although he still failed 
to exceed Individual Z on trait, the dif- 
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ferences between the ratings of the two in- 
dividuals on all traits were reduced. The 
largest remaining difference (Maturity) had 
been reduced from 1.81 to .60 as a result of 
the program. His profile mirrored that of X, 
an experienced newscaster, after only one 
news program. 

The verbal stereotypes associated with In- 
dividual B made changes in both directions, 
thus dispelling the notion that there was a 
general increase in mean ratings due to the 
recency of having seen the program. Further- 
more, the magnitudes of the negative changes 
for B (the editorialist) were relatively great, 
whereas the changes in a positive direction 
were small and insignificant. 

The follow-up ratings were obtained after 
a 15-week interval had elapsed. During 12 of 
these weeks Individual A did the newscasting 
for the new program; and Individual B no 
longer appeared on this program, although he 
did participate in other programs for Station 
Y. Newscaster X was on his regular program 
on Station Z throughout the 15-week period. 

A supplementary questionnaire showed that 
this group watched news on the new program 


about twice as frequently as they had on the 
old Y-TV show. How much of this increase 
was due to ego-involvement with this new 
program as opposed to other possible vari- 
ables such as a more favorable time, a better 


liked cast, etc., cannot be determined. It 
should be noted that none of the respondents 
reported watching the new program exclu- 
sively; the median frequency for watching 
Individual X on Z-TV news was two-and-one- 
half times per week. 

Newsman X enjoyed a general increase in 
follow-up mean ratings over his postshow 
ratings, gaining on all but one trait—Matu- 
rity. The mean increases were small, however, 
and the general pattern was the same as the 
two previous ratings, indicating that this 
group had developed a stable picture of X 
before the preview and his ratings were in- 
fluenced very little even after changes in the 
viewing habits of the subjects. 

On the other hand, Broadcaster B moved 
further away from the desirable end of the 
continuum; his only remaining overlap with 
the other two persons being on Traits Fast 
and Aggressive. Several hypotheses suggest 


themselves as possible explanations of this 
phenomenon, the most credible of which is 
that the group was reinforced for their nega- 
tive attitude toward B—he no longer ap- 
peared on the program—thus intensifying 
their negative feelings. Many variables were 
operative over the elapsed time, however, and 
any explanation must remain conjecture. 

The follow-up ratings of Individual A 
showed slight mean increases on 8 of the 12 
traits over his postshow ratings. Whereas on 
the earlier ratings, A had mean ratings con- 
sistently below X, he now exceeded X on one 
trait—Informed—and tied him on_ several 
others. His profile was now generally closer to 
X’s with the gap being wider on only two 
scales—Fast and Aggressive. 

Unfortunately, it is impossible to perform 
tests of significance on the changes which 
took place, for all measures are intercorre- 
lated. It is possible to express numerically the 
differences between any two profiles, although 
no satisfactory statistical tests have been de- 
veloped to test these differences. The measure 
referred to is D (the generalized distance 
function) which is defined as D = \/Xdj; 
where dj; is the differences in means for the i™ 
polar pair and the j™ individual rated. This 
measure takes into account the differences be- 
tween profiles as well as the correlation be- 
tween them. 

Table 1 presents 27 of the possible 36 pro- 
file comparisons. Ds for individuals in one 
time period compared to other individuals in 
other time periods were omitted because they 
lacked meaningfulness. 

Although there are no statistical tests to 
evaluate the magnitude of Ds, a crude esti- 
mate of the amount of random errors asso- 
ciated with this technique in this situation 
can be obtained from the D value for Indi- 
vidual X for the comparison of preghow and 
postshow profiles. If the instrument had per- 
fect reliability and judgments were independ- 
ent, i.e., changes in ratings of one individual 
did not effect the ratings of other individuals, 
the D value for this comparison would be 
zero. The value obtained .65, the square root 
of the squared differences over 12 pairs, can 
be interpreted as being very small; therefore, 
it appears that the instrument has relatively 
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Attitudes towards Newscasters 


TABLE 1 


PROFILE Ds Computep for THE THREE INDIVIDUALS oN oF Turee Time Pertops 


Pretest Posttest 


Follow-up 


A 


Pretest 
0.00 


0.00 
Posttest 2. 1.21 


5.63 


1.25 


0.00 


I ollow up 


high reliability and the ratings appear to be 
relatively independent. 

Using this value as a refererfce point, it is 
apparent that the majority of the Ds are of a 
sufficient magnitude so that the observed 
differences are not likely to be chance varia- 
tions. Possible exceptions would be the differ- 
ences in profiles of X’s pretest, posttest, and 
follow-up ratings; and the profile changes for 
A from posttest to follow-up. All other Ds 
are at least twice the magnitude of the 
“error” term. Certainly neither the successive 
changes in B’s profiles nor the change in A’s 
profile as a result of the experimental program 
seems likely to be anything but real changes 
in attitude. 


Validity of the Technique 


This technique has certain obvious face 
validity. As noted before, Individual X was 
a highly successful television news broad- 
caster, enjoying the highest audience ratings 
of any program of its kind in the area. His 
initial ratings and successive semantic profiles 
were extremely high and positive, thus re- 
flecting his esteem. 

In addition to the face validity of the tech- 
nique, the postshow group interview with the 
audience provided some evidence which cor- 
roborated the profile changes. For example, in 
reviewing the tape of the interview, it was 
found that the ratio of negative comments to 
positive comments was nearly 3 to 1 for In- 


0.00 


7.42 


1.89 


7.80 


dividual B. This same ratio for A was 1 to 12. 
Certain individuals in the group stated that 
they would not voluntarily watch B on any 
type of program. Derogatory remarks were 
made about his delivery, voice, and ‘‘personal- 
ity.” With the exception of one individual, no 
one said they enjoyed him on a news program. 

With respect to Person A, direct evidence 
for a positive change in attitude was obtained. 
One woman said that she had seen him on 
several morning programs and just did not 
care for him, but here she liked him. Much 
emphasis was placed on the maturity which 
he now assumed. One man said he remem- 
bered A blowing bubble gum and could not 
forget the incident, but on the news, he 
stated, A seemed mature and informed. An- 
other woman commented that he seemed to 
control his “little-boyish-evebrows.’ 


SUMMARY 


A group of 20 adults, 9 men and 11 women, 
previewed a new television-news program on a 
closed channel hookup in a Y-TV studio. The 
semantic differential was employed to meas- 
ure the change of audience attitude toward 
certain individuals as a result of watching the 
program. Ratings were obtained at three dif- 
ferent times: prior to the program, imme- 
diately following the program, and 15 weeks 
later. The results indicated: 

1. For an individual who did not appear on 
the program, X, but with whom the entire 
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group was familiar, the instrument showed 
highly reliable and consistent results. 

2. This technique proved to be extremely 
sensitive to the changes in verbal stereotypes 
which took place as a result of watching the 
preview. The postprogram ratings accurately 
predicted the follow-up ratings. 

3. There was an indication of both “face 
validity” and corroborating evidence from the 
interview attesting to the validity of the 
technique. 


REFERENCES 
Jenkins, J. J., Russert, W. A., & Sucr, G. J. An 
atlas of semantic profiles for 360 words. Amer. J. 
Psychol., 1958, 71, 688-699. 
Minpak, W. A. A new technique for measuring ad- 


vertising effectiveness. J. 
379. 

Oscoop, C. E. The nature and measurement of 
meaning. Psychol. Bull., 1952, 49, 197-237. 

Oscoop, C. E., & Luria, Zevxa. A blind analysis of a 
case of multiple personality using the semantic dif- 
ferential. J. abnorm. soc. Psychol., 1954, 49, 579- 
591. 

Oscoop, C. E., & Suci, G. J. A measure of relation 
determined by both mean difference and profile 
information. Psychol. Bull., 1952, 49, 251-262. 

Oscoop, C. E., Suci, G. J., & TANNENBAUM, P. H. 
The measurement of meaning. Urbana: Univer. 
Illinois Press, 1957. 

STAGNER, R., & Oscooo, C. E. Impact of war on a 
nationalistic frame of reference: I. Changes in gen- 
eral approval and qualitative patterning of certain 
stereotypes. J. soc. Psychol., 1946, 24, 187-215. 


Market., 1956, 20, 367- 


(Received March 26, 1960) 


\. 
i, 
fe 
1 
| 
ie wat} 
i 
? 
Ee 
4 
i 
4, 
tt 
‘a 
if 
4 


Journal of Applied Psychology 
1961, Vol. 45, No. 1, 41-44 


THE EFFECT OF EXPOSURE TO AN EXTREME 
STIMULUS ON JUDGMENTS OF SOME 
STIMULUS-RELATED WORDS 


BERNARD J. FINE 


Quartermaster Research and Engineering Center, Natick, Massachusetts 


Since the early scaling studies of Volkmann 
(1936) and Sherif (1936), among others, 
many experiments have demonstrated end- 
anchoring effects. The phenomenon generally 
occurs when an extreme stimulus or “anchor” 
is added to a scale as an end-point. Addition 
of the anchor tends to extend the scale so that 
subsequent judgments on the scale are ex- 
pressed in less extreme categories. This has 
been referred to as a “contrast” effect by 
Sherif, Taub, and Hovland (1958). 

The anchoring effect has been shown to 
occur with judgments concerning such diverse 
phenomena as inclinations of lines (Rogers, 
1941), pleasantness of colors (Hunt & Volk- 
mann, (Heintz, 1955), and 
various occupations (Mc- 
1943). Johnson (1945) concluded 
that since the phenomenon has been demon- 
strated in judgments on such a diversity of 
material, it may be taken as a general prin- 
ciple of judgment. 


1937), weights 
the prestige of 
Garvey, 


If the anchoring effect is a phenomenon 
that may be generalized, then the implications 
for the experimenter using judgment scales to 
estimate the impact of certain controlled 
stimuli on humans are quite serious but, 
often, not readily apparent. 

It is obvious to most investigators dealing 
with psychophysical phenomena that anchor- 
ing effects confound their data. For 
example, Campbell, Lewis, and Hunt (1958) 
found that when subjects were asked to judge 
the pitch of notes in terms of a piano key- 


may 


board reference system, differences in judg- 
ment of a common note’s pitch were obtained 
depending upon whether the subject first 
1 These authors also demonstrated that the inter- 
position of anchors can result in judgments shifting 
toward the anchor as well as from it. They 
refer to this as an “assimilation” effect. It occurs 
with the introduction of the anchoring stimulus 
immediately adjacent to the stimuli being judged. 


away 


heard low or high comparison notes in a 
series. In such instances, however, the experi- 
menter is generally aware of this effect and 
through appropriate ordering of the stimuli 
can prevent its occurence. However, there is 
little evidence that the anchoring phenom- 
enon has been taken into account as a vari- 
able in judgment situations other than the 
psychophysical. This is particularly true ing 
studies involving the use of ordinal scales 
before, during, and after exposure of subjects 
to a controlled stimulus in order to evaluate 
the effect of the stimulus on the subjects 
judgments. 

The present study was designed to illus- 
trate the necessity for taking anchoring phe- 
nomena into account in judgmental situations 
involving ordinal scales. 

An experiment, reported elsewhere (Fine & 
Gaydos, 1959), in which subjects were ex- 
posed to ambient temperature conditions, 
which, it could be assumed, made them feel 
colder than they had ever felt before. pro- 
vided the setting for this investigation. It was 


assumed that exposure of the subjects to this 
extreme cold stress would have the effect of 


interposing a new, more extreme anchoring 
stimulus at the “cold” end of an ordinal scale 
on which they were required to rate cold- and 
warm-toned words. It was hypothesized that 
as a consequence of their exposure to the cold, 
“cold” the 
warm-cold scale would shift away from the 
newly imposed, 
trast” effect). 


subjects’ ratings of words on 


extreme end-point (‘con- 


MrtTHOD 
Sub je cts 


enlisted men from Fort Devens, 
Massachusetts. They were brought to the 
master Research and Engineering Center 
groups of 12 to participate in experiments conducted 
20 March, 1958 


Subjects were 14( 
Quarter- 
dailv in 


during the period 3 to 


Ie 

Be 
ey 
i 

9. 
| 

te, 

ity 

7 

| 

11 


4? Bernard J. Fine 


Procedure 

Each day, the 12 subjects were randomly divided 
into three groups. Six subjects (Group 1) were as- 
signed to an experiment in which they were exposed 
to an environment of 50°F., 50% relative humidity, 
and 5-mph wind speed for 75 minutes, clad only in 
shorts and lying virtually motionless. Groups 2 and 
3, both three-man groups, were assigned to a second 
experiment. Group 2 was exposed to an environment 
of 50°F., 50% relative humidity with no wind for 
1 hour. This group worked on a psychological task 
during the exposure period and was stripped to the 
waist. Group 3 served as the control group tor 
Group 2. Subjects in Group 3 performed the same 
task as those in Group 2 but were fully clothed and 
in a warm, comfortable environment. By nature of 
the combinations of ambient conditions and clothing 
worn, Group 1 could be described as being exposed 
to a rather extreme cold (for subjects not 
accustomed to such exposure), Group 2 could be 
characterized as being exposed to a moderate cold 
stress, and Group 3 to no cold stress. 

Prior to taking part in these experiments, all sub- 
jects completed a scaling task (Session 1) under 
warm, comfortable conditions. The task can best be 
described by quoting the instructions accompany- 
ing it. 


stress 


Below you will notice something that looks like 
a thermometer. Let us suppose that this thermom- 
eter shows how hot or cold people feel instead of 
how hot or cold the temperature is 

You will notice that there are numbers from / 
to 16 running up from the bottom to the top of 
the thermometer. One means that it is as cold as 
a person can possibly feel. Sixteen means that it is 
as hot as a person can possibly feel 

Now, we all use many different words to tell 
other people how hot or cold we feel. These words 
that we use sometimes mean different things to 
different people. We would like to know what they 
mean to you. 

We have listed 14 of these words alongside of 
the thermometer. We would like you to write next 
to each word the number on the thermometer 
which shows what the word means to you in terms 
of feeling hot or cold. 

For example, take the first word, “irozen.”’ Sup- 
pose you tell your buddy that you feel “frozen.” 
You would probably mean that you feel as cold as 
you could possibly feel. So, you would write in / 
beside the word “frozen” because the thermometer 
shows that 7 is as cold as you could possibly feel 

Take the second words, “a little warm.” Now, 
you know that 7 is as cold as you could possibly 
feel and that 16 is as hot as you could possibly 
feel. Therefore, “a littke warm” must fall some 
where between 1 and 16. We would like to know 
where you think it should fall. That is, if you were 
to tell your buddy that you were “a little warm,” 
how hot or could would you mean? Do this for 
all of the words. You may use the same number 

more than one word 


The words and phrases rated by the subjects were 
“a little warm,” “slightly cold,” “extremely warm,” 
“terribly cold,’ “warm,” “extremely cold,” “hot,” 
“a little cool,” “very warm,” “cool,” “very cold,” 
“extremely hot,” and “cold” in that order. The in- 
structions were read aloud to the subjects and care 
was taken to insure that all subjects understood the 
task. After completing the scaling task, the subjects 
participated in the experiments to which they had 
been assigned 

Approximately 6 hours later, 2 hours after the 
subjects had completed their tasks, had eaten a good 
meal, and relaxed for a short while, all subjects again 
completed the identical scaling task (Session 2) 
under the same comfortable environmental condi- 
tions as initially. No explanation for the repetition 
of the task was given other than that the experi- 
menter would like to have the subjects rate the 
words once again. 


RESULTs 


Fifteen subjects turned in ratings that were 
incomplete or which indicated that they had 
not understood the instructions. The data 
from these subjects were eliminated from the 
analysis. After the elimination of these sub- 
jects, the Vs for groups 1, 2, and 3 were 58, 
29, and 38, respectively. 

Since the words scaled by the subjects were 
not a random sample of words describing 
warmth or coldness, it was not feasible to 
determine the variance of the subjects’ rat- 
ings and, therefore, it was not possible to 
compute the significance of the differences in 
shifts in judgments between groups. How- 
ever, it was possible to create a total popula- 
tion of changes in ratings of all words (from 
Session 1 to Session 2) and to compare the 
three groups individually with regard to these 
changes. Accordingly, the before-after changes 
in ratings for the 13 words were treated as a 
given finite population over all subjects. A 
subsample consisting of all of the “cold” 
words was drawn from that population and 
the ¢ test was used to show whether or not 
the sample was a random one. Thus, the mean 
change for the subpopulation of “cold” words 
was compared with the mean change for the 
total population of words for each of the three 
groups. The results of these comparisons are 
shown in Table 1. 


2 The author wishes to acknowledge the assistance 
given him in various phases of the data analysis by 


Agnes M. 
Monroe. 


Galligan, G. P. Wadsworth, and Ella 
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TABLE 1 
COMPARISON OF CHANGE FROM SESSION 1 TO SESSION 
2 oF RaTincs oF Warps AND ALL Worps 
FOR THREE GROUPS 
Mean Changes 
Group 1 Group 2 


NV Change V Change 


“Cold” words 406 +.160 03 —.010 
All words 377 109 
1.73 


+) indicates a shift away from the cold 


) indicates a shift toward the cold end of t 


If there were no systematic effects of ex- 
posure to cold on subjects’ ratings, it would 
be expected that the subpopulation of “cold” 
word changes in ratings would not differ sig- 
nificantly from the total population of changes 
in ratings. It is evident from Table | that this 
is the case for Group 2 and Group 3. How- 
ever, for Group 1, the subpopulation of “cold” 
word changes in ratings does differ signifi- 
cantly from the total population of changes. 
It is further evident that the shift is away 
from the cold end of the scale. 

To show that this statistically significant 
shift in ratings of “cold” words by Group 1 
was not due to extreme shifts of only two or 
three “cold” words, the mean shifts for each 
word by groups were computed. This analysis 
revealed that six out of seven of the “cold” 
words shifted away from the cold end of the 
scale for Group 1, whereas only one “cold” 
word out of seven shifted away from the cold 
end of the scale for either Group 2 or Group 
3. The difference between Group 1 and either 
Group 2 or 3 in the number of “cold” words 
shifting away from the cold end of the scale 
is statistically significant (p< .025 using 
Fisher’s exact probability test). 


DISCUSSION 


The data substantiate the hypothesis that 
exposure to extreme cold would shift subjects’ 
ratings of “cold” words away from the cold 
end of a warm-cold ordinal scale. This is con- 
sistent with the findings in the literature on 
anchoring phenomena. Exposure to moderate 
cold had no significant effect on subjects’ 
ratings. 


While the magnitude of the changes in rat- 
ings is small and probably of little practical 
consequence, it should be noted that this is 
more likely than not a function of the struc- 
ture of the scale and the number of words 
rated. Certainly, rating 13 words on a 16- 
point scale minimized the chance for change 
to occur. Had more scale points or fewer 
words been used, shifts might have been con- 
siderably larger. 

The implications of this study for investi- 
gators in similar situations may be illustrated 
by the following example. Subjects in Group 
1 were required, as part of their performance 
in the main study in which they were partic- 
ipating, to periodically indicate on a five-point 
ordinal scale their feelings of warmth or cold- 
ness during the cold exposure. Since the words 
on the five-point scale also appeared on the 
16-point scale of the present study, it is cer- 
tain that at some time, during the exposure, 
the meaning, to the subjects, of the words on 
the five-point scale, changed as the subjects 
became colder. A subjective cooling curve. 
plotted from this data, would have little 
meaning for it would not reflect a change in 
subjects’ feelings alone, but also a change in 
the meaning, to the subjects, of the words 
used in the rating task. 

One way of preventing such occurrences is 
to provide the subject with a physical con- 
comitant to the verbal anchor points before 
the actual experimentation starts. As in psy- 
chophysical studies where the subject judges 
certain stimuli against a standard stimulus, so 
in situations such as the present one a stand- 
ard should be provided. Thus, in the present 
situation, had the subjects all been exposed 
for 75 minutes to the same cold conditions at 
some time prior to the actual experimental 
session, they would have had a referent for 
the anchor point “as cold as you can possibly 
feel” and their subsequent judgments would 
probably not have been confounded by 
changes in the meanings of words. 


SUMMARY 


The study was designed to test the hy- 
pothesis that subjects’ ratings of cold-toned 
words would shift away from the cold end of 
a warm-cold ordinal scale as a consequence 
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of exposing the subjects to extremely cold 
temperatures. 

Three groups of subjects were exposed re- 
spectively to extremely cold, moderately cold, 
and warm ambient conditions. Before and 
after exposure, all subjects rated 13 words 
describing various degrees of warmth or cold- 
ness on a 16-point warm-cold ordinal scale. 

The results verified the hypothesis; ratings 
of the “cold” words for the group exposed to 
the extreme cold shifted away from the cold 
end of the scale. The results are consistent 
with previous research on end-anchoring 
effects. 

Implications for the researcher using or- 
dinal scales in situations where subjects are 
exposed to extreme stimuli with which they 
have had little experience are discussed. 
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AN EXPERIMENTAL STUDY OF THE EFFECTIVENESS 
OF BRAINSTORMING ' 
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The problem of creativity has become the 
focus of much attention during recent years. 
“Creativity” as used in this paper can be 
briefly defined as “the production of new 
ideas.” Especially the study of the conditions 
under which people are most likely to pro- 
duce creative ideas has aroused considerable 
interest. The policies of educators, of adminis- 
trators in business or industry, and of others 
may be strongly affected by better knowledge 
of such conditions. This may apply to rela- 
tively permanent conditions such as “the emo- 
tional atmosphere” among co-workers, or to 
temporary conditions such as attitudes pro- 
duced by the type of instructions with which 
a problem is presented to the potential crea- 
tor of ideas. 

The present research is concerned with such 
temporary increases in productivity produced 
by the widely-used technique of “brainstorm- 
ing” (Osborn, 1957). When engaging in brain- 
storming a group of people is presented with 
a problem, and with instructions such as the 
following regarding the way in which its solu- 
tion should be sought: 


Group work is good only if everyone lets himself 
co, and thinks and says whatever idea occurs to him, 
matter how silly it may seem the 
most ridiculous idea may be better than some others 
Therefore, try to avoid criticizing your own or some 
one else’s ideas. If anyone is criticizing I will remind 
him Laughing and sneering 
criticism. We want a lot of ideas; the more we have. 
the better there is to find the better ones 
Don’t worry about how good the ideas are: they will 


evaluated later. When you try to create and judge 
+} 


no Sometimes 


to stop. are torms ot 


chance 


at the same time, nothing much is accomplished: you 
get too bogged down in judging, rather than in 
creating. Also, it has been found that good ideas are 


sometimes created by improving on other ideas and 


rhe paper is based on a thesis presented to the 
graduate school of Purdue University, Lafayette, In- 
diana, by the junior author in partial fulfillment of 
the requirements for an MS degree in psychology. 
The study was directed by the senior author 

‘Now at Veterans Administration Hospital, Leb 
inon, Pennsylvania 


by combining two or more ideas into new ones. Are 
there any questions? Let me summarize the way we 
will operate: 1. Criticism is ruled out. 2. The wilder 
the idea the better: let yourself go and say whatever 
comes into your head. 3. Quantity is wanted. 4. Com- 
bination and improvement of ideas is desired. 


These instructions were used in the present 
experiment. In other cases of brainstorming 
the instructions were paraphrased somewhat 
differently but in each case they were elabora- 
tions of the four italicized points which are 
summarized at the end of the previous para 
graph. 

It is the purpose of this study to compare, 
as to quality as well as quantity, the ideas 
produced by instructions suggesting -the sus- 
pension of criticism and the ideas produced 
by instructions suggesting critical censorship. 


METHOD 


Subjects. The subjects were 42 undergraduate col- 
lege students at Purdue University. They were ran- 
domly divided into six groups, each group consisting 
of students, four which males and 
three of which were The were not 
matched for intellectual level because it was assumed 
that random assignment would cancel out intellectual 
differences to a large extent, and that the remaining 
uncanceled differences would be of necligible 
nitude because of the narrow margin in intellectual 
ability among college students 

Task. The task with which the were pre- 
sented was the invention of brand names for three 
kinds of articles designed especially for Purdue stu 


seven ot were 


females groups 


mag- 


subjects 


dents, namely, a cigar, a deodorant, and an auto- 
mobile. There are several reasons why these tasks 
seemed especially adequate for the purpose of thx 


present (a) The creating of brand names and 


similar activities pertaining to the fields of 


tudy 


idvertis 


ing, publicity, or promotion constitutes the problem 
area for which brainstorming was originally devel- 
oped, notwithstanding the fact that its use was later 


(b) The eval- 
uation of the effectiveness of brand names constitutes 
somewhat less of a problem than the evaluation of 
suggested solutions to other problems. Thus, quality 
ratings by judges who are Purdue students seem 
relatively adequate measures of the effectiveness of 
brand names created to impress Purdue students. It 
would more difficult to find 


extended to many other problem areas 


be equally adequate 
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measures of the effectiveness of ideas regarding, for 
example, the solution of technical, social, or personal 
problems. (c) Creating brand names is a relatively 
“simple and, specific” problem that is “not too broad” 
and can be “stated clearly” (Osborn, 1957). Such 
problems have been said to lend themselves especially 
well to brainstorming procedures. 

Procedure. The junior author met with the six 
groups of subjects for one session each. With three of 
the six groups (to be called “the noncritical groups” 
in the following discussion) he started the session 
with the brainstorming instructions quoted above. 
With the other three groups (to be called “the criti- 
cal groups”), he started the session with the follow- 
ing comments: 


No idea is ever worth anything unless it has 
been well thought out; in doing this you have to 
think about the problem and see it in all its as- 
pects. It is only when you do this that good ideas 
are produced. We want to have a number of ideas; 
but we want good, practical ideas. Let’s try to 
avoid stupid or silly ones. Also, it has been found 
that good ideas are sometimes created by improv- 
ing on other ideas and by combining two or more 
ideas into new ones. Are there any questions? Let 
me summarize the way we will operate: 1. Think 
clearly and logically. Try to see all the aspects of 
a problem. 2. We want good, practical ideas. 3. We 
want a number of ideas from which to later judge 
the better ones; however, the emphasis is on qual- 
ity not on quantity. 4. Combination and improve- 
ment of ideas is desired. 


Thus it was attempted to keep the working condi- 
tions of the critical groups as similar as possible to 
the ones of the noncritical groups (e.g., Point 4 of 
the instructions was identical for all groups) with 
one exception: the noncritical groups were instructed 
to rule out criticism while the critical groups were 
instructed to exert it. 

Following the two kinds of introductions all 
groups were given the following explanation of their 
task: 


One of the factors involved in marketing a 
product is the brand name. Some names have an 
appeal to most of the public, others do not. We're 
going to create some names for three products that 
would attract most of the men students here at 
Purdue. The purpose behind a brand name is to 
make it as attractive as possible so that the poten- 
tial consumers will want to buy the product. Let’s 
assume that a new cigar is to be put on the market 
which is designed for young men. Let’s create some 
names that will attract the Purdue male. 


After 10 minutes the group was asked to stop. 
Then the next task, finding a brand name for a 
deordorant that will attract the Purdue male was 
introduced in a similar manner. Finding a name for a 
car that will attract the Purdue male was the last 
problem to be solved. Ten minutes were allotted for 
each of the three products. A pilot study had sug- 


gested that any additional time allotment would re- 
sult in a negligible number of additional ideas. 

Evaluation. Next, the brand names were evaluated 
by 150 Purdue male undergraduate students none of 
whom had participated in the original experiment. 
It would have required too much of each judge's 
time to rate the total 902 different brand names 
Therefore, each judge was asked to rate only about 
one third of the total number of responses: this 
allotment of brand names to the judges was done in 
such a way that each name was rated by 50 judges. 
The names were presented in a different random 
order for each judge. 

The rating was performed on a five-point scale ac- 
cording to the following written instructions to the 
judges: 


Brand names are an important consideration in 
selling a product. Some names are appealing to 
most of the potential customers, others are not. 
This afternoon, we would like you to rate some 
names for a product. Assume that you wished to 
buy the product; then rate the names according to 
how much it attracts you. As you can see on the 
first page of your sheets, A means the name has a 
VERY GREAT ATTRACTION, B—a GREAT 
ATTRACTION, C—SOME ATTRACTION, D— 
LITTLE ATTRACTION, and E—VERY LITTLE 
ATTRACTION. Please place a circle around either 
the A, B, C, D, or E according to how much the 
name attracts you. Please work quickly and rate 
according to your first impression of the name 


On the basis of this evaluation a quality score was 
computed for each brand name, defined as the total 
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Fic. 2. 
noncritical responses 
product: Deodorant. 


Cumulative frequencies of the critical and 
according to quality of the 


weight of the A and B ratings, whereby a weight of 
five was given to the A ratings and a weight of four 
to the B ratings. The reliability of the scores was 
determined by dividing the 50 raters for each group 
of responses into two randomly selected groups of 25 
raters each and by correlating the ratings performed 
by these two subgroups of raters. The overall Pear- 
son product-moment correlation for all responses was 
72, significant at the 1% level 


ANALYSIS, RESULTS, AND DiIscUSSION 


The responses obtained under the critical 
and noncritical condition were compared as to 
quantity as well as quality. 

For each product, the noncritical condition 
produced a greater number of responses than 
the critical condition. The probabilities that 
differences, equaling or exceeding the ob- 
ones, between the number of non- 
critical and critical responses, would occur by 
chance, were less than .003°%, for each of the 
three products. 

Figures 1, 2, and 3 show the cumulative 
frequency distributions of the quality scores. 
As can be seen, the relationships between the 
distributions of the noncritical and the critical 
names are very similar for the three products. 

The figures suggest that there is no appre- 
ciable difference between the number of re- 


serve 


sponses given by the two groups at the left 
end of the distribution where the quality of 
the responses is extremely high. With decreas- 
ing quality the difference increases. Thus, the 
preponderance of responses from the non- 
critical group over the ones from the critical 
group owes its origin to the large number of 
responses of relatively low quality in the non- 
critical group. 

As can be seen from Table 1, the mean 
quality score of the critical responses is 
greater than the mean quality score of the 
noncritical responses for all three products, 
and this difference is statistically significant 


beyond the 5, 1, and 0.5% level, respec- 


tively, for “Automobile,” “Deodorant,” and 
the total of all responses. 


Table 2 gives further indications of the 
qualitative superiority of the critical re- 
sponses. It indicates that the critical responses 
received relatively more quality scores above 
the overall mean than the noncritical re- 
sponses. 

Figures 4, 5, and 6 show the cumulative 
relative frequency distributions of the two 
groups of responses. As can be seen, the rela- 
tionships shown in Figures 1, 2, and 3 are 
drastically changed when the total number of 
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TABLE 1 
COMPARISON OF THE MEAN QUALITY SCORES OF THE NONCRITICAL 
AND CRITICAL RESPONSES 


Standard 
Condition N Mean SD Error t 


Product 


NC 41.49 
Cc 65 45.26 27.83 4.07 93 


Cigar 


Deodorant NC 218 38.44 25.94 
80 48.14 


2.46** 


Automobile NC 326 45.40 26.59 
Cc 83 


51. 1.68* 


Potal NC 665 42.41 26.07 
48.78 


* Significant at 5‘) level (one-tailed test) 
at 1°) level (one-tailed test) 
at 0.5% level (one-tailed test) 


. responses for both response groups is con- small number of responses is required. There 
trolled. There is a tendency for the critical the two methods tend to be equally promis- 
curve to exceed the noncritical curve except ing, with a slight advantage of the critical 
} for responses of extremely low quality. method. The cumulative number of noncrit- 
gE, What practical conclusions can be drawn __ ical responses begins to be significantly higher 
‘ from these comparisons, assuming that their than the cumulative number of critical re- 
: validity may extend beyond the scope of this sponses within and below the quality intervals 
a study? with the midpoint 48 for “Cigar,” 58 for 
i The fact that the noncritical curve tends to 
Pi: rise above the critical curve except for a small 
E number of responses of highest excellence Lor O— Non-critical 
ie (Figures 1, 2, 3) has important practical im- 9 * --Critical 
‘ plications. If it is the purpose of problem , 
rk solving to produce a specific number of ideas 8 
of highest possible quality, these best ideas 
; will tend to be of higher or equal quality 7 
he when the noncritical method is used; this is 
the case because a horizontal line will tend to 6 


cut the noncritical curve at a point closer, or 
equally close, to the ordinate than the point 
at which it cuts the critical curve. A possible 
exception to this rule occurs when a very 
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COMPARISON OF RELATIVE FREQUENCY OF RESPONSES 
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378 378 Fic. 4. Relative cumulative frequencies of the non- 
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Fic. 5. Relative cumulative frequencies of the non- 
critical and critical responses according to quality 
of the product: Deodorant. 


“Deodorant,” and 91 for “Automobile” (be- 
yond the 1° level). However, we must take 
into consideration that it is more time con- 
suming to judge a large number of noncritical 
responses than a smaller number of critical 
responses as to quality; it takes longer to 
identify the responses of high quality among 
the noncritical responses than among the crit- 
ical responses. To what extent the disadvan- 
tage of this added time investment would 
detract from the advantages of the noncritical 
method would depend on the specific. sit- 
uation. 

On the other hand, according to the results 
of this study, the advantages of the noncrit- 
ical method appear to be based mainly on the 
larger number of responses produced. Tables 
1 and 2, and Figures 4, 5, and 6 indicate that 
the data support a decision in favor of the 
critical method if the number of critical re- 
sponses can be increased to equal the number 
of noncritical responses. However, it must 
again be taken into consideration that it 
would require more time, or more subjects, to 
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critical 


Relative cumulative frequencies of the non 


and critical responses according to qualit 


of the product: Automobile 


produce a given number of responses by the 
critical method than to produce the same 
number by the noncritical method. 

Finally, it should be kept in mind that the 
results of this study are a function of the 
criteria by which the responses were eval- 
uated. If the evaluation is based on ratings 
the relative merit of brainstorming might de- 
pend on a variety of attitudinal character- 


istics of the raters. The fact that such con- 
ventional names as “Sportsman,” “Esquire.” 
or “Century” were among the three most 


highly-rated brand names suggests that the 
imaginative ideas of the brainstormers were 
wasted on the conservative taste of the judges 
Perhaps judges and consumers with a greater 
preference for unusual brand names could 


throw a more favorable light on_ brain- 
storming. 
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OUTPUT RATES AMONG MACHINE OPERATORS: 


III. A NONINCENTIVE SITUATION IN TWO LEVELS 
OF BUSINESS ACTIVITY 


HAROLD F. ROTHE ann CHARLES T. NYE 


Fairbanks, Morse and Company, Beloit, Wisconsin 


Previous studies (Rothe, 1946, 1947, 1951; 
Rothe & Nye, 1958, 1959) have led to some 
hypotheses relating the consistency of output 
of industrial operators to the adequacy of the 
financial incentives in their work situations. 
These hypotheses are: 


{1] . .. the incentives to work may be considered 
ineffective when the ratio of the range intra- 
individual differences is greater than the ratio of the 
range of interindividual differences (Rothe, 1946, p 
326). 

[2]... if the intercorrelation of output rates for 
two periods closely related in time is less than .70, 
the incentivation is not highly effective, while inter- 
correlation higher than .80 indicates effective incenti- 
vation (Rothe & Nye, 1958, p. 185). 


of 


This paper presents output data on other 
groups of industrial operators and relates 
these data to those hypotheses. These data 
lend some support to one of the hypotheses, 
and they tend to weaken the other one. These 
data are interesting in their own right partic- 
ularly because there are two sets of data ob- 
tained from one plant at two different times 
and in two different levels of business activity. 


BACKGROUND OF THE STUDY 


The employees were machine operators in Plant C 
to distinguish it from Plants A and B, previously 
reported). Plant C is a rather small plant in a city in 
northern Illinois where there are several large plants 
The employees performed work generally similar to 
the work done in Plants A and B. They ran a variety 
of metal machine-tools, and standard production 
data were available. They were paid an hourly rate 

no incentive—and data were available for an 11- 
week period in 1958 and a 12-week period in 1960. 
Unlike Plants A and B, Plant C was a fairly new 
plant, with new equipment. Just as Plants A and B 
undergoing economic conditions the 
these studies were made, Plant C also 
poor economic conditions in In 
1960 Plant C was enjoying favorable business con- 
ditions 

Plant C was particular interest 
though it had no financial incentive system, 
had a practi 


were at 
was 


1958 


poor 
time 
undergoing 


of because, al 
it had 


of insisting that all employees reach a 


50 


certain amount of standard production or be dis- 
ciplined—perhaps discharged. Language in the collec- 
tive bargaining agreement permitted this practice. As 
a matter of fact, this practice was said to be in effect 
at the time of the study; but as the data reveal, this 
was not true. The standard production of each em- 
ployee was posted daily, but there was no other 
reward for high production, nor discipline for sub- 
standard production 


DATA 


The weekly average output (in which 100 
equals standard) and the number of em- 
ployees for whom data were available each 
week of the 1958 study are shown in Table 1. 
The weekly efficiencies varied from 90.13% 
to 94.29° and showed no trend with time. 

The weekly average output and the number 
of employees for whom data were available 
each week in 1960 are shown in Table 2. In 
Table 2 the weekly efficiencies varied from 
92.91% to 96.01% and again showed no 
trend. The average output was higher in 1960 
than in 1958 (95.2 versus 92.5), and although 
the number of employees varied somewhat 
from week to week during each period, there 


TABLE 1 

WEERLY AVERAGE OutTpuT (PERCENTAGE 

PERFORMANCE OF STANDARD) FOR GROUP 
OF MACHINE OPERATORS, 1958 


Number of 
Employees 


Percentage 
Performance 


Week Ending 


93.24 
91.37 
91.06 
91.83 
93.47 
94.29 
93.41 
93.79 
91.87 
93.45 
90.13 


April 27 
May 4 


June 
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were many more employees in 1960 than 
there were in 1958. This latter occurrence 
indicates that business was much better in 
1960 than it was in 1958. 

In all studies of output rates in this series 
the distributions of the rates of the employees 
for any week were approximately normal, al- 
though the distributions for the machine op- 
erators in Plant B were slightly skewed 
(Rothe & Nye, 1959). In Plant C most of 
the distributions for the 1958 period were 
distinctly not normal; for the 1960 period the 
distributions were more nearly approaching 
normality. In fact, the shapes of the distribu- 
tions are probably the most interesting parts 
of this study. For that reason the complete 

‘distributions of all the obtained data are pre- 
sented here in Table 3 for 1958 and Table 4 
for 1960. 

The correlation of the employees’ perform- 
ances for 1 week with their performances for 
the following week was calculated by the 
Pearsonian r method. The median r thus ob- 


Machine Operators 


TABLE 2 
AVERAGE Output (PERCENTAGH 
PERFORMANCE OF STANDARD) FOR GROUP 
1960 


WEEKLY 


OF MACHINE OPERATORS, 


Number of 
Employees 


Percentage 
Performance 


Week Ending 

Jan 96.01 
94.46 
96.54 
95.69 
95.01 
95.74 
94.62 
92.99 
95.04 


95 44 


95.49 
94.26 


tained was .48 with a range of .29 to .S8O for 
1958. The median ¢ for 1960 was .53 with a 
range of .17 to .72. 


TABLE 3 


FREQUENCY 


OPERATORS FOR ALL ELEVEN WEEKS, 


Percentage 
Performance 
to Standard 


April 


110 
105 
100 
95 
90 
85 
80 
75 
70 
65 
60 
35 
50 
45 
40 
35 
30 
25 
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DISTRIBUTIONS OF WEEKLY 


AVERAGE OUTPUT OF 
1958 


Week Ending 
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TABLE 4 


FREQUENCY DISTRIBUTIONS OF WEEKLY AVERAGE OvuTPpUT OF ALL OPERATORS 
FOR ALL TWELVE WEEKS, 1960 


Percentage 
Performance 
to Standard 10 17 


January 


115 
110 
105 
100 
95 
90 
85 


45 
40 
No of 


Operators 


The ratio of the range of intra-individual 
differences was obtained here, as in the earlier 
studies, by comparing the output for the most 
productive week with the output for the least 
productive week for each employee. The me- 
dian of these ranges was 1.18 in 1958 and 
1.14 in 1960. The ratio of the range of inter- 
individual differences was also obtained in the 
usual manner by comparing the performances 
of the most productive employee and least 
productive employee for each week. This ratio 
was 1.96 in 1958 and 1.57 in 1960. 


DISCUSSION 


In Plant C, where no financial incentive 
plan was in effect, but where, supposedly, 
substandard employees were disciplined, the 
week-to-week correlations of output rates 
were .48 and .54, which tends to support one 
of the two hypotheses previously proposed. 

The ratios of the ranges of inter- and intra- 
individual differences are opposite to what 
would be expected according to the other 
hypothesis regarding the effectiveness of in- 


Week Ending 


February March 


i4 21 : 13 20 


& 


centives. That is, the interindividual ratios 
exceeded the intra-individual ratios for each 
year, and this relationship has been hypothe- 
sized as indicating effective incentivation: 
This hypothesis clearly needs more evidence 
or more refining since it is strongly suspected 
that the incentivation in this situation was 
not very effective. 

The most striking feature of the present 
study was the skewness of the distributions of 
output, particularly in 1958. This is shown in 
Table 3. This phenomenon occurred during a 
period of poor business when employees were 
being laid off and when the average produc- 
tion was only 92.5°> performance to stand- 
ard. In 1960, when business was better and 
employees had been recalled to work, the 
average production rose to 95.2 and the varia- 
bility and skewness of the weekly distributions 
were reduced as the instances of extremely 
low productivity practically disappeared. 
Skewness of output has long been hypothe- 
sized as indicating “restriction of output.” 
Ford (1931) and Yoder (1942) suggested 
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that negatively skewed distributions indicated 
restriction, but they had no data to support 
their hypotheses. Bliss (1931) using data 
from some “bench workers” found output to 
be skewed positively, and he attributed this 
to “lack of motivation.”” More recently Gaudet 
and Livingston (1959) wrote that a nega- 
tively skewed distribution is often found for 
incentive piece workers, and this indicates 
“restriction of output.” They also present no 
data to support this view. 

In the light of all this theoretical discussion 
of skewness of output distributions, it is per- 
haps all the more noteworthy that the present 
study is the only one in the authors’ series of 
studies of output rates in which a clear and 
important skewness has been found. In fact, 
only in one previous study was there any 
indication of skewness, and in that one there 
Was an incentive system in effect, and it 
seemed to be effective (Rothe & Nye, 1959). 

It seems desirable to illustrate the skewness 
found here more effectively. For that purpose, 
data for the week of the largest number of 
employees of Table 3 have been plotted on a 
histogram which appears as Figure 1. 

It cannot be concluded definitely that the 
incentives to produce were ineffective here, 
but that is strongly suspected. One thing is 
certain: there was no financial incentive in 
effect. On the negative side, it is also certain 
that substandard producers were not disci- 
plined as they might have been (i.e., “pun- 
ished”). It is quite likely, therefore, that the 
incentives to work were not very great, and 
this was accompanied by a negatively skewed 
distribution of output rates, especially in 
1958. 

It was hoped to test Bedford’s hypothesis 
that both fast and slow producers have a 
common mode, that fast producers show less 
variation than slow producers, that the dis- 
tribution of output for fast producers is posi- 
tively skewed and the distribution for slow 
producers is negatively skewed, and that these 
phenomena may serve as an objective meas- 
ure of “restriction of output” (Bedford, 
1922). The distributions of the “best” five 
and the “poorest” five employees were plotted 
for each year, but there were insufficient data 
to permit the drawing of any conclusions. 
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Fic. 1. Output rates for 55 machine operators for 1 
week in a nonfinancially-incentivated shop, 1958 


This study of machine operators was char- 
acterized by strongly skewed distributions of 
output rates in 1958, and less skewed distri- 
butions in 1960. It tends to support one hy- 
pothesis previously presented, and to weaken 
another hypothesis. This paper presents, for 
the first time, significantly negatively skewed 
distributions of actual output rates. These oc- 
curred in a situation apparently accompanied 
by a lack of incentivation (either positive, as 
money, or negative, as disciplinary action). 
They occurred in a period of poor business 
conditions when employees were being laid 
off. Two vears later, when business had im- 
proved, and laid-off employees were recalled 
to work, the productivity was higher, had less 
variation, and was less skewed in distribution. 
This paper also shows, as do all papers in this 
series of industrial operators, the need for 
further studies of output, either as a phenom- 
enon that is inherently valuable, or because 
output is used as a criterion against which the 
effectiveness of other variables is sometimes 
measured. This paper further shows the need 
for describing the economic conditions or 
level of business existing at the time that in- 
dustrial studies are made. It supports the 
theory presented by other writers that a nega- 
tively skewed distribution indicates “restric- 
tion of output.” Unlike other papers which 
proposed this theory but had no data, this 
paper presents data showing negatively 
skewed actual output data in a situation 
where the incentives to work were probably 
not very effective. 
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THE NEWCOMER IN OPEN AND CLOSED GROUPS’ 


ROBERT C. ZILLER, RICHARD D. BEHRINGER, axp MATHILDA J. JANSEN 


Fels Group Dynamics 


While group stability has long been recog- 
nized as a major dimension in organization 
theory (Hemphill, 1949), relatively few lab- 
oratory experiments have been reported con- 
cerning group membership changes. The lab- 
oratory study reported here was designed to 
investigate the reactions to a newcomer by 
groups which had been apprized of various 
imminent membership changes (open groups) 
and groups which did not anticipate member- 
ship changes (closed groups). The study was 
also designed to explore the relative power of 
the newcomer and the regular group member. 

Simmel (Wolff, 1950, p. 402-408) and von 
Wiese and Becker (1932) have speculated 
extensively concerning the consequences of 
the addition of a new member to the com- 
munity or society. In general their proposi- 
tions describe the hostility which attends the 
advent of a newcomer to rather stable groups. 
Becker (1950, p. 50) also suggests that the 
newcomer is deeply disliked by the regular 
members since he represents the forces of 
change (also see Heider, 1958, p. 193--194). 
However, the afore-mentioned observations 
tend to concern implicitly only what is here 
referred to as closed groups. The open-closed 
group concept imposes a new dimension on 
the newcomer phenomenon and renders the 
earlier analyses incomplete. 


METHOD 
Subjects 


Sixty-four male social science students participated 
in the experiment as a requirement of their courses 
Each of the 32 three-person experimental groups was 
composed of two naive subjects and a confederate 


Procedure 


Two naive subjects and a confederate, ostensibly 
another naive subject, were seated around a standard 
note-passing device in which they were obscured 
trom each other’s view but could communicate by 
exchanging notes. 


‘This paper is an extension of a report presented 
to the Eastern Psychological Association meetings in 
Atlantic City, New Jersey, April 4, 1959. 


Center, University of Delaware 


In the first phase of the experimental procedure, 
the groups were involved in two tasks, a Meier Art 
Judgment item and a numerosity task. This phase of 
the experiment requiring approximately 25 minutes 
was ostensibly part of a group decision-making ex- 
periment but in reality was designed to facilitate the 
development of the group qua group ' 

Before the next and critical problem was intro- 
duced, the group was told that since the experiment 
also concerned the relationship between group size 
and the quality of group decisions, another member 
would be added to the group. The new member (an- 
other confederate) was introduced to the regular or 
core members and “instructed” as to the operation of 
the note-passing apparatus 

The third problem was again a numerosity task 
One thousand and fifty black dots were displayed on 
a white similar to a ping-pong paddle. The 
form was displayed for only When the 
members had submitted their individual estimates as 
to the correct number of dots, 
mitted “ten minutes in which to reach a group deci- 
sion as to the correct number of dots.” Finally, when 
the group estimate had been submitted, the individ- 
ual members submitted private estimates of the 
“number of dots they personally really thought there 
were on the card.” 


iorm 


seconds 


the group was per- 


Experimental Conditions 


The 2 X 2 factorial design allowed the comparison 
of the relative influence 
confederate) and a regular member (actually 
confederate) under open- and closed-group condi- 
tions. The open condition was created by stating at 
the outset of the experiment and again after the first 
task had been completed that “during the course of 
either a member will be asked to 
leave the group or a new member will join the 
group.” Under the closed conditions this statement 
was eliminated and it was presumed that the groups 
expected to remain intact during the entire session 

The experimental conditions designed to test the 
relative influence of a regular member (a confed- 
erate) and the newcomer (another confederate in the 
same group) were established by providing either the 
newcomer or a regular member with the correct 
inswer and procedure for arriving at that answer to 
the critical task. When the newcomer proposed the 
correct method and answer, the regular member pro- 
posed essentially the same method but a 
answer. The conditions were reversed when testing 
the relative influence of the regular member. All con- 
federates were enjoined to the “leadership” 
role, that is, the function of submitting the final 
group estimate. It should be noted that the task was 
of such a degree of complexity that the naive mem- 
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bers could neither estimate the correct number of 
dots with confidence nor determine or happen upon 
the correct approach. 


Measures 


The dependent variables included measures of 
communication direction (number of messages sent 
by the naive subjects to the newcomer and the core 
confederate), group influence, group satisfaction, and 
the relative ratings of the confederates by the naive 
group members. It was presumed that each of these 
measures reflected to some degree the extent to which 
the newcomer was assimilated by the original group. 

Two measures of influence were involved. The 
first was derived from the group mean (excluding 
the confederates) of the weighted responses to the 
following item: “I changed my individual estimate 
of the number of dots a great deal as a result of the 
group discussion.”” The alternatives were arranged on 
a six-point scale varying from “agree very much” to 
“disagree very much.” This index represents the in- 
dividual member's perceived change or influence. 

The second measure of influence was simply the 
difference between the group decision and the cor- 
rect answer. Because of the inherent positive skew- 
ness of the distribution of these scores, it was neces- 
sary to apply the cube root transformation.® 

An objective measure of the group's satisfaction 
with the group estimate was also derived from the 
dot data. The mean difference of the members’ post- 
group decision estimates from the group’s decision 
was calculated and again the cube root transforma- 
tion was applied. 

Finally, the naive members rated themselves as 
well as the confederates on nine-point descriptive 
scales regarding the following statements 

1. This person’s ideas with reference to the dot- 
estimate task were of the very highest quality 

2. This person is extremely pleasant and contrib- 
uted a great deal to making this session enjoyable. 

3. This exerted a_ very influence 
throughout the meeting. 


person great 


RESULTS 


The data were subjected to an analysis of 
variance described by C. A. Henderson (un- 
published report, 1959) and Federer (1955) 
for combined nested (analysis involving a 
comparison of individuals within the same 
group) and factorial classification. An ex- 
ample of the type of analysis is presented in 
Table 1. 

Examination of the results in Table 1 indi- 
cates that there is a significant difference be- 
tween open and closed groups (p < .01) in 


* These are the same measures used in a previous 
methodological study in this series (Ziller & Behr- 
inger, 1959). 

8H. Simon, personal communication, 1957 


TABLE 1 
ANALYSIS OF VARIANCE OF RATINGS 
OF PLEASANTNESS 


D Random 


Source Error Term 


25.0430 

4.6800 

9 6800 

0521 

AC 4.7663 
BC 
ABC 1.0372 
D, AB 4.4273 
CD, AB 2.3125 
E, DAB 2.9732 
CE, DAB 1.3661 


CD, AB 
D, AB 
CD, AB 
CD, AB 
CD, AB 
DAB 
CE, DAB 


Note “A” refers to the open-closed group dimension 
to the condition where the regular member (a confederate 
another confederate) possesses the correct ans 
mber (regular t r or newcomer); “I 
ratings by the naive ‘rs of the two confer 
ratings of the confederates by a single naive men 


ratings of pleasantness with reference to both 
confederates. The higher ratings appear in the 
open groups. In addition, when a less con- 
servative error term is used (CE, DAB) there 
is a significant difference between the ratings 
of regular confederates and newcomer con- 
federates (p< .05), the newcomer ratings 
being lower. 

In order to conserve space, the tables of 
analyses are omitted. However, with reference 
to “quality of ideas” the ratings of both con- 
federates were significantly higher again for 
open as compared with closed groups (/ 

.05), and higher for the regular confederate 
than the newcomer confederate (p < .05; CE, 
DAB, error term). With regard to “influ- 
ence,” again the regular confederate was rated 
significantly higher than the newcomer con- 
federate (p < .01; CE, DAB, error term). 

With regard to the results of the analysis 
involving communication direction, the inter- 
action effect of open-closed conditions and the 
rated confederates was found to be significant 
(p < .05). More notes were sent by the naive 
members to the newcomer in the open condi- 
tion: more notes were sent ‘to the regular 
member in the closed situation. 

The analyses of the objective measure of 
influence involving the accuracy of group 
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solutions did not yield significant results. 
However, the results involving the subjective 
measure of influence or estimated change by 
the naive group members were statistically 
significant (p < .01). Those groups in which 
the regular member possessed the correct 
solution reported less change than those where 
the newcomer was correct (p < .001). 

With regard to group satisfaction greater 
private agreement with the group answer was 
revealed under the open conditions as opposed 
to the closed conditions (p < .01; CE, DAB, 
error term). 


DISCUSSION 


At a very general level of abstraction, the 
results indicate that members of open groups 
react in a very different way than members 
of closed groups to the other members of the 
group and particularly to the newcomer. This 
rather general conclusion assumes greater 
trenchancy when it is recalled that the open 
groups differed from the closed groups only 
in the fact that the open groups had been 
informed at the time of their inception of 
some impending membership changes. 

Thus, in the open groups which were pre- 
sumed to be more task oriented and _ less 
interpersonally oriented, the members with 
the most superior methods of approaching the 
task and the correct answer (the newcomer as 
well as the regular confederate) were more 
accepted by the members of open groups than 
by closed groups. 

However, a further differentiation between 
the newcomer and the regular members comes 
to light with regard to the number of notes 
received by these respective confederates. In 
terms of assimilation, the results indicate that 
the newcomer is admitted and integrated with 
greater facility by the open groups than by 
the closed groups. 

It is proffered that under open conditions 
or conditions of social metabolism where the 
elements of a complex are temporarily inter- 
related (due to the anticipated replacement, 
removal, or addition of members), constant 
adjustment to the specific characteristics of 
the changing membership as well as to the 
changing roles and status hierarchies is per- 
ceived by the members as dysfunctional. It is 
recognized that energy is only dissipated in 


Newcomer in Open and Closed Groups 


an endless effort to establish transitory inter- 
personal relationships. 

In open groups in contrast to closed groups, 
measured performance of the group as a 
whole offers a more durable basis for group 
structure. Accordingly, open groups are less 
concerned with who actually performs the 
task. Task accomplishment is a sufficient con- 
dition for member satisfaction (Cartwright & 
Zander, 1953, p. 310). 

In the event of a newcomer, a redistribu- 
tion of prestige symbols and social cathexis is 
less necessary in open groups. In addition, the 
newcomer may be perceived by the members 
of open groups as a resource for increasing 
the probability of group success rather than 
as a restrictive and/or disruptive supernumer- 
ary threatening to attenuate the shares of the 
limited interpersonal or prestige rewards 
within the group. Or perhaps very simply, the 
newcomer enters a far more complex, inter- 
related system of elements in closed groups 
than in open groups, and assimilation is ac- 
cordingly accomplished with diminished facil- 
ity and increased hostility—at least during 
the initial phases of the assimilation process. 

It is compelling to speculate briefly con- 
cerning the implications for small group re- 
search and social science in general of these 
results involving the open-closed group con- 
cepts. It is all too clear that small group 
experiments have been concerned almost ex- 
clusively with and products of 
closed-reaction systems, systems in which the 
forces tend to move toward a state of equilib- 
rium (see Heinicke & Bales, 1953). However, 
this traditional closed model is scarcely iso- 
morphic with regard to social systems which 
also concern the minatory aspects of illness, 
death, and change of group composition in 
general. 

A basic characteristic of our social systems 
is that they maintain themselves in a state of 
perpetual change of components. It is true 
that within a given time span the composition 


processes 


of the social system appears static. However, 
the perception is nonveridical. We are de- 
ceived by our own parallel movement along 
the flow of time. Moreover, perpetual changes 
involving the components of the system are 
obscured by the relatively “steady state” of 
the system of the molar level. 
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The relatively slow rate of change in com- 
position of most social groups may also tend 
to obscure the phenomenon. If the rate of 
change of elements in a social group were to 
be markedly increased however, the conse- 
quences would not only become manifest but 
obtrusive if not chaotic. Perhaps, in fact, 
there exists an optimum rate of change for 
various objectives of the group. 

A second aspect of the experiment concerns 
the naive members’ reactions to the newcomer 
as compared with the core confederate. In 
this respect, the “oldtimer” was rated higher 
under all conditions with regard to “quality 
of ideas,” “pleasantness,” and “influence” 
(see Table 1). Thus, even for a relatively 
short group “history,” the regular members 
appear to react more positively to the longer 
term member. Tenure and status appear to be 
positively related. 


SUMMARY 


The dimension of group stability was in- 
vestigated in a laboratory setting. The assim- 
ilation process of a new member was studied 
in open groups (groups which had been ap- 
prized of various impending membership 
changes) as compared with closed groups 
(groups which did not anticipate membership 
changes). The study was also designed to ex- 
plore the relative power of the newcomer and 
the regular group member. The experimental 
design required two confederates, one who 
participated with the group from the outset, 
and another who joined the group after the 
group had performed two decision-making 
tasks. In each experimental group, either the 
newcomer or a second confederate held the 
correct answer and a correct method for arriv- 
ing at that answer. The group interacted 
through the medium of a standard note- 
passing apparatus. 

Under the open conditions in comparison 
with the closed conditions the naive members 


were more positively disposed towards the 
confederates, rated the ideas of both con- 
federates higher, and were more satisfied (on 
an objective criterion) privately with the 
group solution. Moreover, under the open 
conditions, more notes were remitted to the 
newcomer confederate than to the regular 
confederate; while under the closed condi- 
tions, more notes were submitted to the 
regular confederate than to the newcomer 
confederate. These results were interpreted as 
indicating that open groups tend to be less 
interpersonally oriented but more task or- 
iented than closed groups. 

Finally the regular members tended to 
react more positively to the longer term 
member; that is, tenure and status 
found to be positively associated. 


were 
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Traditionally, two different research de- 
signs have been employed by investigators in 
their use of biographical data. These are 
directly analogous to the present-employee 
and follow-up methods of test validation. 
Exemplifying the latter are “weighted appli- 
cation blank” studies such as those of Kirch- 
ner and Dunnette (1957), Minor (1958), 
and Navlor and Vincent (1959). In this 
approach, the pool of items for validation 
consists of those that appear on the em- 
ployer’s application form, filled out at the 
time of hiring by the criterion groups. Typ- 
ically, the criterion used is job tenure, so that 
items are retained which discriminate the long 
and short tenure individuals. A validity co- 
efficient based on the composite of surviving 
items would be a predictive validity estimate. 
The nature of the items retained, using this 
design, is almost always strictly demographic 
(age, marital status, number of dependents, 

In the other design, which we will term the 
“personal history” approach, an instrument 
covering various background topics is admin- 
istered to presently employed individuals who 
have been segregated into criterion groups. In 
contrast to the weighted application blank 
design, a diversity of criterion measures has 
used here; for example, the “human 
relations” effectiveness of industrial supervi- 
sors (Edgerton, Feinberg, & Thomson, 1957), 
gross earnings of automobile salesmen (Ken- 
nedy, 1958), and creativity of machine de- 
signers (Owens, Schumacher, & Clark, 1957). 
Validity coefficients in these studies are con- 
current validity estimates. Whereas, the 
weighted application blank is generally lim- 
ited to factual items, the personal history 
questionnaire can include additionally items 
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covering preferences, attitudes, and interpre- 
tation of experience (Rundquist, 1950). 

The present study involves the attempt to 
validate an extensive personal history form 
with petroleum research scientists, using three 
different criteria of success in research work. 


METHOD 


Subjects. The Ss for this study were volunteer 
male Standard Oil Company’s 
largest research laboratory. All of them were college 
graduates, the majority possessing advanced degrees. 
The most common areas of educational specialization 
were chemistry and chemical engineering with a few 
individuals having degrees in other technical 
such as mathematics, physics, etc. The length of serv- 
ice of the group ranged from less than 1 year to over 
20 years, the heaviest single concentration (30%) 
occurring in the 1- to 5-year interval. Seventy-six 
percent were less than 40 years of age. The number 
of Ss varied from 331 to 198 missing 
predictor or criterion data 

Predictor. The personal history questionn 
was tailor-made for this study 
multiple-choice items. The 
from various sources. The content of the question- 
naire included home and family background, 
aspects of present and previous jobs 
ests, school and college activities, et: 

Criteria 
the study 


employees at the 


areas 


because of 


iire used 
and consisted of 484 
items 


were assembled 
various 
athletic inter 
The criterion which was used initially in 
was a rating of the researchers on over-all 
These 
administrative pur- 
they were not collected specifically for this 
investigation. The rating scheme used is a seven-step 
forced-distribution system, such that ratings of 1 and 
7 represent the high and low ends of the distribution, 
respectively. The percentages of Ss at each point on 


job performance made by their supervisors 
ratings are made periodically for 


poses; 


the distribution are as follows: 1, 10%; 2, 15%; 3, 
15%; 4, 20%; 5, 15%; 6, 15%; 7, 10%. For pur- 
poses of item analysis, the 20% with ratings of 4 
were eliminated, leaving the high and low 40% as 


criterion groups. Some evidence for the reliability of 
this criterion is the fact that any man’s rating is the 
product of the judgment of several different super- 
visors. Also, it is reported that relatively little shift 
in rating occurs from vear to year for most people 
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Some time after the project was underway, it be- 
came possible to collect ratings regarding the creativ- 
ity of the Ss. The rating instrument used was the 
Check-List Rating Scale for Creativity: Form C-1, as 
developed by Taylor (1958). This is a Thurstone- 
type scale of 24 descriptive items. The rater checks 
only those items that apply to or describe the ratee. 
The range in favorableness from “He 
shows signs of being one of the most creative men 
in this work that I have known” to “He never has 
an idea of his own to suggest.” The reliability of the 
scale is suggested by Tavlor’s correlations, which 
averaged .83, between the creativity checklist ratings 
and descriptive ratings of the same individuals made 
4 to 5 months later on “originality.”” In accordance 
with Jurgensen’s rationale (1949), the form was 
scored by computing the algebraic sum of the scale 
values instead of using the median scale value, as 
Taylor did.! On this basis, individuals with checklist 
scores of —160 to —10 made up the low criterion 
from +50 to 


statements 


group and those whose scores ranged 
+180, the high group 

The third criterion 
patent disclosures filed by 
year period 1954 to 1958 inclusive 
closure is a document written by the 
which he 
received for 


the number of 
during the 5 
(A patent dis- 
technical man 
as patentable 
the disclosure, it must 
patent advisory com- 
further processing it may eventually 
application.) This 
span was chosen after an examination of the distri- 
bution of disclosures showed that this period of time 
included data to be reliable 
Only those Ss were included in this part of the study 

5 who 


were 


consisted o 


each man 


describing an idea regards 


Before credit is 


be screened by an in-company 
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become a formal patent 5-year 
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about one per 
the units during spare moments at their work places 
or to take them home if they 
to be returned before the 
given.” Due to the voluntary 
normal turnover, travel schedules, et 
in numbers of Ss took 

For the item 


subdivided 


wished, One unit was 


next was 
project, 
attrition 
the study. 


completed and 
nature of the 
some 
during 


inalysis, all criterion groups were 


randor into validation and _ cross- 


validation The numbers of Ss in each art 


Both the 


scormeg 


median value and algebrai 


procedures were us¢ the two sets o 


correlate .924. The Jurgens: 


therefore, more on rational grounds 
basis of statistical superiority 
Administrative 


pec ially H R 


collection, as 


The writers are grateful to the 
Division at the 
Ahlberg. his 


well as for 


tesearch Laboratory, « 
ssistance in the data 
other help rendered 


during the study 


given in Table A.“ Ns ranged from 25 to 68, but 
were approximately equal for high and low samples 
for a given criterion and content unit. Using a pro- 
gram developed tor the IBM 705, the significance of 
the difference in response percentages of the high and 
low groups was computed for every re- 
sponse, validation and cross-validation samples sepa- 
rately. A scoring key was developed for each crite- 
rion from items which discriminated in both 
the validation and cross-validation samples at or 
beyond the .10 level, making a compound probability 
requirement of .05 or The keyed items were 
given weights of +1 or —1 in accordance with the 
direction of discrimination 

The 
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those 


less 


concurrent validity of these keys was esti- 
the papers of 100 Ss 
randomly from the pool of Ss having complete per- 
sonal history and criterion data the Ss in 
this “scoring sample” were middle criterion individ- 
uals never used in the item analyses, but others had 
been in the keying samples. An empirical study 
Albright, Smith, Glennon, & Owens, in press) 
pared the validity this ap 
parently contaminated obtained 
using the conventional 
Katzell, 1951). 


‘contaminated” 7 was very 


scoring selected 


Some of 


com 
obtained by 
method with 
double cross-validation 
Contrary to expectation, the 
close to the larger of the 
two sample r’s, which Katzell proposed as the better 
estimate of the true validity of the composite 
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RESULTS 


Representativeness of subjects. The fact 
that the Ss were volunteers raised a question 
as to their representativeness of the entire 
professional staff in the laboratory. A check 
of representativeness was made by testing the 
distribution of obtained over-all performance 
that 
forced-distribution system. This was done for 


ratings against expected under the 


all five questionnaire units. None of the five 


values of y° was significant at the 5°% level, 


indicating that the sample was not 
tionately weighted with Ss from one or an- 


dis} 


other rating category. This test was also made 
for the 100 randomly selected Ss in the scor- 
ing sample. with nonsignificant results. 
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Intercorrelations of criteria. Table 1 pre- 
sents the intercorrelations of the three crite- 
ria. These values are based on the total num- 
bers of Ss for whom the criterion data were 
available, regardless of whether they filled out 
the personal history questionnaire. As might 
be expected, the two sets of ratings correlate 
more highly with each other than either does 
with the patent disclosure criterion. However, 
all three of the coefficients shown are signifi- 
cant at the .01 level. 

Validity and reliability of scoring kevs. 
When the three sets of item weights were 
applied to the papers of the scoring sample, 
the results shown in Table 2 were obtained. 
As can be seen, the concurrent validity of the 
37-item over-all performance key was .613. 
Correspondingly, for the creativity and pat- 
ent disclosure criteria the validities are .521 
and .517, respectively, both based on 22 items. 
All of these coefficients far exceed the value 
required for significance at the .01 level. 

The number of items shown for each key 
may be somewhat misleading because the 
same item may discriminate on more than one 
criterion (although the same response is not 
necessarily scored each time). In fact, three 
items are included in all three keys. A total 
of 59 different items are scored. 

An estimate of the reliability of some of 
these scored items was obtained when 25 re- 
searchers consented to retake two of the per- 
sonal history units about 2 months after their 
original responses had been submitted. These 
two units contained 29 of the 59 discriminat- 
ing items. The correlation of the 29-item 
scores obtained on the two administrations 
was .683. This value is similar in magnitude 
to those reported by other investigators 
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(Fiske, 1947; Guilford, 1950) with very larg 
numbers of Ss. 


DISCUSSION 


The relatively high validities of the three 
scoring keys demonstrate the utility which the 
personal history technique can have with 
highly skilled individuals. (It is acknowledged 
that these coefficients are doubtless somewhat 
higher than those which would be obtained 
with an entirely new sample of Ss; but 
stated previously, the writers’ earlier study 
in press, suggested that the amount of infla- 
tion was insignificant.) As is often note 
aptitude and other types of tests which worl 
well at lower ability levels may fail to dis 
criminate in groups of this kind. Also 
would be difficult to find another instrument 
which would yield a better “rate of return 
in predictive effectiveness per unit of testing 
time. For example, most applicants can easil 
complete the 59 keyed items in 30 minutes 

The only other studies in this general are 
of which the writers are aware are those by 
Mandell (1950), Stein (1957), and Tay! 
(1958). Mandell, using a 37-item inventory 

1; 


was unable to discriminate chemists rate 


high and low on iob nerformance by the 
superiors and colleagues. Taylor found 


20 out of 47 personal history items were 


nificantly related to either or both of two cri- 


teria for 94 electronics engineers and scie! 
tists. The criteria were ratings of productivity 


and creativity. However, when he extended 


the scoring keys to a group of physicists. only 


chance relationships obtained. Stein reports 
a number of biographical variables which dif- 
ferentiate significantly between his groups of 
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“more” and “less” creative industrial research 
chemists. There were 23 individuals in each 
group, the groups being determined by rank- 
ings and ratings from superiors, colleagues, 
and subordinates. No cross-validation data are 
reported. 

Although some of our “personal history” 
items are similar in nature to those found in 
tests of personality, interests, and values, it 
is felt that they engender less hostility on the 
part of the applicant and are less subject to 
faking than is characteristic of many such 
tests. For one thing, they are presented in 
context with other items which do require 
responses of a strictly factual nature. Also, 
the applicant may do himself as much harm 
as good by trying to appear the “Organization 
Man,” due to the empirical weighting pro- 
cedure used. 

A review of the discriminating items sug- 
gests that a “‘self-confidence” factor may be 
what these items assess. This interpretation is 
reinforced by the frequency with which the 
high criterion groups say that they (a) have 
more readily taken advantage of opportunities 
presented them, (0) consider their achieve- 
ments thus far to be greater than those of 
others with the same education, (¢c) work 
more quickly than others, and (d) prefer to 
have many things “on the fire’ simultane- 
ously. Also the high groups tend to have more 
education than their colleagues, have obtained 
it on scholarships or fellowships, have worked 
as teachers or instructors, have published at 
least one technical paper and devote much 
time to reading—suggesting the presence ad- 
ditionally of an “academic orientation” factor. 


SUMMARY 


A personal history questionnaire was ad- 
ministered to a group of petroleum research 
scientists employed in a research laboratory 
of the Standard Oil Company. The question- 
naire items were analyzed and cross-validated 


three criteria of research effective- 
ness: ratings of over-all performance, ratings 
specifically on creativity (both made by su- 
pervisors of the researchers), and objective 
records of the number of patent disclosures 
produced by each man during a 5-year period. 


against 


Three keys, totaling 59 items, were applied 
to a scoring sample of 100 randomly selected 
Ss. The concurrent validity estimates resulting 
were .613, .521, and .517 for over-all perform- 
ance ratings, creativity ratings, and patent 
disclosures, respectively. These results were 
interpreted as indicative of the value of the 
personal history technique with a highly select 
group. 
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A complete library 
of psychology in 


one volume — 


CLASSICS IN 
PSYCHOLOGY 


Edited by Thorne Shipley 
In a text of over 1400 pages, are the works 
of 35 leading psychologists of Europe, the 
United States, and the Soviet Union, span- 
ning a period of one hundred and _ fifty 
years. 


Some of the chapters: 
PHILIPPE PINEL: 


Treatise on Insanity 
JEAN MARTIN CHARCOT: 
Diseases of the Nervous System 
WILHELM WUNDT: 
Sensory Perception 
ERNST MACH: 
lhe Relation of the Physical to the Psychical 
WILLIAM JAMES: 
Principles of Psychology 
EMIL KRAEPELIN: 
Clinical Psychiatry 
E. B. TITCHENER: 
Structural Psychology 
BENJAMIN RUSH: 
The Diseases of the Mind 
MANFRED SAKEL: 
Shock Treatment of Schizophrenia 
EUGEN BLEULER: 
Dementia Praecox 
ALFRED ADLER: 
Individual Psychology 
I. P. PAVLOV: 
Conditioned Reflexes 
JOHN B. WATSON: 
Behaviorist Views 
HERMANN RORSCHACH: 
*sychodiagnostics 
AUGUST AICHHORN: 
Wayward Youth 
G. STANLEY HALL: 
Ade ence 
JEAN PIAGET: 
The Language and Thought of the Child 
KURT KOFFKA: 
Perception 
ISAAC RAY: 
Jurisprudence of Insanity 
WILLIAM McDOUGALL: 
Social Psychology 
JOSEF BREUER & SIGMUND FREUD: 
Studies of Hysteria 
Cc. G. JUNG: 
Analytical Psychology 


$20.00 


You can erpedite shipment by enclosing remittance 


PHILOSOPHICAL LIBRARY, Publishers 
15 East 40th Street. New York 16, N.Y. 
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A New Dorsey Teat 


READY IN JUNE 


STUDIES IN PERSONNEL AND 
INDUSTRIAL PSYCHOLOGY 


By EDWIN A. FLEISHMAN, Yale University 


This volume offers to the student an up-to-date, well-rounded, and integrated view 
of the field of personnel and industrial psychology. Each section of the book is 
organized around the key problems and issues in selection, performance appraisal, 
training, motivation and morale, leadership, organization, accident prevention, work 
conditions, and engineering psychology. Since the articles were selected with a 
view to cover adequately the key problem areas, the text achieves an outstanding 
balance of coverage in each major section. Each of the nine sections is introduced 
with original text material designed to review the subject matter. A sufficient num- 
ber of articles are available to enable the instructor to select for his particular class 
needs. 


WRITE FOR AN EXAMINATION COPY 


THE DORSEY PRESS, INC., HOMEWOOD, ILLINOIS 
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STUDIES IN ITEM ANALYSIS AND PREDICTION 


Edited by Herbert Solomon. The book consists of an integrated series of 
mathematical studies and presents many new theoretical developments in both 
test design and classification of individuals on the basis of responses to tests. 
The studies will be of use to psychometricians, to mathematical statisticians, 
and to educational psychologists as well, and for research in tests and measure- 
. ments. 

Stanford Mathematical Studies in the Social Sciences, V1. $8.75 


MARKOV LEARNING MODELS 
FOR MULTIPERSON INTERACTIONS 


Patrick Suppes and Richard C. Atkinson. This book represents a major at- 
tempt to bridge the gap between statistical learning theory and social psychol- 
ogy by applying stimulus sampling techniques to social interaction situations, 
and particularly to multiperson situations that have a game structure. 

Stanford Mathematical Studies in the Social Sciences, V. $8.25 


Order from your bookstore, please 


Ur Stanford University Press 
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Selected New Books in Psychology 


THE MOTIVATION 
OF BEHAVIOR 


By JUDSON S. BROWN, Univer- 
sity of Florida. McGraw-Hill 
Series in Psychology. 432 pages, 
$7.50. 


The chief emphasis of this carefully de- 
veloped analysis of motivation is on the 
utility of the motivation concept as an ex- 
planatory component of general behavior 
theory. The Hullian conception of drive is 
emphasized, with an examination of al- 
ternative theoretical interpretations. In- 
cluded are studies of animal motivation and 
a survey of research on human motivation. 
This text is interesting and intelligible for 
both advanced and beginning graduate 
students. 


ADJUSTMENT AND 
PERSONALITY 


By RICHARD S. LAZARUS, Uni- 
versity of California. McGraw- 
Hill Series in Psychology. 480 
pages, $6.50. 


This text provides thorough coverage of the 
basic issues and points of view in the psy- 
chology of adjustment. This book, viewing 
the process of adjustment as an inseparable 
component of personality, is written at a 
high level of scholarship and will prove 
stimulating and valuable over a wide range 
of undergraduate courses for majors in psy- 
chology, as well as non-majors whose in- 
terest extends beyond the introductory level. 


INTRODUCTION TO 
PSYCHOLOGY, 
Second Edition 


By CLIFFORD T. MORGAN, 
University of Wisconsin. Ready 
in April, 1961. 


This outstanding introductory text has been 
carefully revised. The writing, new design, 
format and materials have been coordinated 
to make it more challenging and rigorous. 
The text, as before, is broad in coverage and 
organized to meet the needs of courses with 
varying emphasis. It covers the traditional 
topics of general psychology and individual 
differences, as well as the physiological, 
social, applied, and clinical! areas. 


PSYCHOLOGY OF 
PERSONALITY, 
Third Edition 


By ROSS STAGNER, Wayne 
State University. 586 pages, 
$7.50. 


The third edition of an excellent text de- 
signed to provide the student with an intro- 
duction to current knowledge of the normal 
personality. Emphasis is placed on the 
author’s conception of personality develop- 
ment and organization, built around ho- 
meostasis and the process of perceiving. 
Alternative formulations are discussed. 
Topics include: attitudes and values, the 
family, the school system, class and eco- 
nomic factors, personality and social values, 
emotional foundations, etc. 


Send for copies on approval 


McGraw-Hill Book Company, Inc. 


330 West 42nd Street 


New York 36, New York 
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